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Abstract 

The goal of this thesis is to apply the computational approach to motor learning, 
i.e., describe the const raints that enable performance improvement with experience 
and also the constraints that must be satisfied by a mo tor learning system, describe 
what is heing computed in order to achieve learning and why it is being computed. 
The particular tasks used to assess motor Learning are loaded and unloaded free 
arm movement , and the thesis includes work on rigid body Load estimation: arm 
model estimation, optimal filtering for model parameter estimation, and trajectory 
learning from practice. Learning algorithms have bten developed and implemented 
in the context of robot arm control. 

The thesis demonstrates some of the roles, of knowledge in learning. Powerful 
generaJi Bations can be made on the basis of knowledge of system structure, as is 
demonstrated in the load and arm model estimation algorithms, Improving thu 
performance of parameter estimation algorithms used in Learning involves knowledge 
of the measurement noise characteristics, as is shown in. the derivation of optimal 
(iltera. Using trajectory errors to correct commands requires knowledge of how 
command errors are transformed into performance errors 1 i.c. : an accurate model of 
'.he dynamics of the controlled system, as is demonstrated in the trajectory learning 
work. The performance demonstrated by the algorithms developed In this thesis 
should be compared with algorithms that use less knowledge, such as table based 
schemes to learn arm dynamics, previous single trajectory Learning algorithms, and 
much of traditional adaptive control, 
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Chapter 1 



Introduction 



Learning Is currently an important distinguishing; characteristic between biological 
systems and machines. Humans and animals tend to improve their motor perfor- 
mance with experience, while machines often repeat the same errors, day in and day 
out. An important step towards understanding ourselves, and also building more 
useful machines, is to begin to understand how learning is achieved. 

The goal of this thesis is to apply the computational approach (MaiT, lSfT; Marr, 
1962; HildrEth and 11 oiler hack, \9Rb) to motor learning. The point of view taken is 
that learning is an information processing problem: a learning algorithm modifies 
commands using information from previous experience so as to improve performance, 
A proposed methodology for computational neuroscience and artificial intelligence 
includes the following steps: (MftTT, l$17l Winston, L9S4) 

1. Identify a particular information processing; problem or task. 

2. Formulate a computational theory^ Expose constraints that enable [or restrict] 
performance, describe what is being computed and why, select or devise ap- 
propriate representations, define procedures to solve problem. 

:t. Devise and implement actual algorithms to solve problem. 



A. Verify and equate the performance of the implementation via experiments. 

The particular tasks used to assess motor learning are loaded and unloaded free 
arm movement. The computational theory developed applies model based feedfor- 
ward control to the arm control problem, and involves elements of control theory and 
system identification. In order to effectively use mode] based control one must have 
an accurate model, and the first half of the thesis describes how to build models of 
particular types of loads and arras, One constraint that la heavily relitd ujjou is that 
the arms and loads described in this thesis can be adequately modelled using rigid 
body dynamics. This allows us to use models with small numbers of parameters and 
to make powerful! generalisations between dissimilar movements. 

The implementation of the model building procedures reveals that Rood models 
can be identified quickly and are useful for control. However! the models used to 
represent the arm and Soad dynamics have .i mi ted decrees of freedom, and cannot 
represent the full complexity of the true dynamics, The second half of the thesis 
describes how ii command can be renned for a particular movement on the basis of 
practice, allowing representation of fine details of the dynamics. The computational 
theory for the learning from practice described here involves making explicit what 
the learning operator that maps performance errors to command corrections should 
be. The implementation reveals fast and effective trajectory learning. 

A major contribution of this thesis is to demonstrate same of the roles oF knowl- 
edge in learning. Powerful generalizations can be made on the basis of knowledge 
of system structure, as is demonstrated in the load and arm model estimation al- 
gorithms. Improving the performance of parameter estimation algorithms used in 
learning involves knowled^fi of the measurement noise characteristics, as IS shown 
in the derivation of optimal filters- Using trajectory errors to efficiently correct 
commands requires knowledge of how command errors are transformed into perfor- 
mance errors, ie. an accurate model of the dynamics of the controlled system, as 



is demonstrated In the trajectory learning work. The performance- demonstrated by 
the algorithms developed ill this thcaia should be compared with algorithms that use 
Icse knowledge, such aa table baaed schemes to learn arm dynamics, previous baaed 
trajectory learning algorithms, and much of traditional adaptive control. 

In order to evaluate the computational theory developed in this thesis learning 
algorithms have; been developed, and implemented in the context of robot arm con- 
trol. Although the experimental work in this thesis la restricted to work with robots, 
the insights gained have a much broader relevance. The adaptive mechanisms used 
to implement learning may differ in different domains, but at a computational level 
the issues and principles remain the same, 

1*1 Choices and assumptions 

1*1*1 Working definition of learning 

The working definition of learning for tbe purposes of this thesis Is improvement of 

performattit uath czpzTizntt, Since the exp Crimen Lai work involves only machines, 
motivational levels, development, and other factora that make analysing biological 
performance improvement difficult are not an issue. 

Two kinds of learning are studied. The first involves building and refimiig internal 
models of one's eelf and the external world, and using those model s. to generate 
the appropriate actuator commands, and to guide processing of sensory data. The 
second involves using internal models to transform performance errors- into command 
corrections, 

1.1.2 Focus on arm trajectory control 

The effector system we will focus on will be a mechanical robot arm, and we will 
mainly be concerned with Improving execution of free arm trajectories, either with or 



without a. load. The types of learning appropriate for this sort of task are probably 

different from the types of learning involved in riding a bicycle or juggling. The 
ideas, in this thesis will hopefully generalize and be useful in studying these other 
tasks, however. 

To provide a framework for what follows let us examine a typical robot control 
architecture. We can divide the robot control problem into deciding what to do 
{planning) and doing it [txecutian} . For current robots planning is usually accom- 
plished by a human programmer, although a goal of this ;md much other research ia 
to incrementally automate robot programming, 

Since we are concerned with robot trajectory execution^ a plan h a complete 
specification of the motion of the robot in some coordinate system. Often the plan 
is expressed in task coordinates, and in order to execute this plan we eonvert task 
coordinates to arm coordinates. This process is referred to as inverse kinematics. 

The next step in making the robot follow the desired trajectory is supplying 
appropriate commands to the actuators. The simplest approach to this is to generate 
these commands by measuring the dine re [ice between where the arm is and where 
it is supposed to be at any instant in time and using some function {usually a linear 
function) of this error as. the drive signal to the actuators. Thia is what is known 
as feedback control. This type of control is useful in compensating for unpredicted 
disturbances- 
One problem with pure feed hack control is that it requires errors in. order to 
generate any drive signals. We can avoid this problem by calculating our best guess 
p.5 to the appropriate command signals to apply to the actuators to follow the desired 
trajectory exactly. This precalculated command is added to the feedback command 
during execution of the motion. I will refer to applying a precalculated actuator 
command as command ff.fidforvrard control Or SJmply feedforward control. The process 
of calculating the appropriate actuator command from a specification of the desired 
robot motion is referred to as invtr&t dbnamits. 
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At this level of detail the same problems of coordinate transformation, predic- 
tion of appropriate actuator {muscle) commands h and compensating for unpredicted 
disturbances must be addressed by biological systems. 

A.& a starting point for our discussion dF learning, let us ask the question, "How 
might we go about building learning into this performance architecture?™ or equiv- 
alency (from the point of view of this thesis) "What mformatioia can be extracted 
ironi past experience to improve present aud future robot performance? B 

Making better plans; Planning can be improved in several wayE: the world 
models used in planning can be refined, better methods for solving the given task 
can be generated, and the planning methods, themselves can be changed- These 
processes are assumed to be independent of improving execution of a given plan, 
and will not be addressed in what follows, We will take the plan as fixed. 

Improving execution of fixed plana: The ihe=sis will :Vcu5 on execution 
and making the effector system obey a given plan more closely. All components 
of execution can be improved using experience. One way of imp roving the various 
coordinate transformations involved in robot control is to refine the kinematic model 
of the robot using measurements from redundant sensing &uch as vision of the robot 
tip and joint angle sensing. To improve feedforward control Wu could refine the 
dynamic model of the robot. We could also design a better Feedback controller using 
the past history of Controller actions. 

Assume kinematics is essentially perfect: We will assume our kinematic 
model of the robot is essentially perfect, a£ there is already much research going on 
in the area of kinematic calibration. 

Assume a fixed feedback controller: As there is currently much research in 
adaptive feedback control we will also take our feedback controller to be fixed. 

In a complete robot learning system plan learning, kinematic learning, adaptive 
feedback control, and the types oF learning discussed in this thesis will all occur 
simultaneously An important point to keep in mind is that they can he handled by 
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essentially independent modules. 

Focus on dynamics and control; In this thesis wc will discuss how to build 

and refine a model of robot dynamics to be used for predicting, the appropriate 
actuator commands to drive the robot (feedforward control). We will discuss how to 
handle certain types of loads. We will also discuss how to build a model or noise in 
sensory data, and how to optimally filter that noise in the model building process. 
In. addition we will show the ™le of the robot model 41s the learning operator during 
movement practice, 3e- the robot model transforms- trajectory following errors into 
feedforward command correct ions, 

1.2 Outline of thesis 

1-2.1 Format of thesis 

Many of the chapters are designed to he independent papere. The price of trying 
to write a thesis whose chapters are publishable is some repetition and seeming 
multiplicity of goals. 

1.2.2 Introduction and Previous Work 

This chapter attempts to give the reader an idea of where he ia going, why, and some 
of the landmarks to look for r Chapter 2, Previous Work b reviews some of the related 
reap p.'c h ■ Other related research is reviewed in the appropriate chapters. 

1.2.3 Arm and Load Identification 

Chapters 3, 4., and 5 describe building models for one^s own arm and any rigid 
body load. Appendix A describes the use of an identified model for the control of a 
[jarticular robot arm. 
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This work demonstrates the power of knowing the system structure for Learning. 
In. this case the knowledge of the rigid body dynamics structure dramatically reduces 
the number of parameters to be learned and increases the learning rate compared 
to tabular [earning schemes (Albun N 1975a, lS75b s Kalbert, 1977, 1978, llaiberl ami 
Horn, 19T&J. More importantly, knowing the system structure allows much more 
;"io r rV<-Tnl pen era": nations to be made. There are two constraints that can be used as 
sources of generalisation, The Gist is smoothness of the input/output transforma- 
tion. This constraint can be used to generalize between similar movements and Is 
relied on heavily by tabular learning approaches. The constraint used in this work, 
system structure. Is more powerful in that information gained From one movement 
can be used to guide quite dissimilar movements. 

The crucial insight of these chapters Ie that systems dominated by rigid body 
dynamics can be described by a relatively small set of inertial parameters that appear 
linearly in the dynamics equations. Estimating these parameters turns out to be 
simple, as the theory and implementations show. We start with load identification 
[Chapter 3] because thai it the simplest version of the story, and with very little 
additional work we apply the same ideas to arm, identification [Chapter 4). 

Tht : reliance nr; <=y?tftrn sVrucL'.in: li> achieve generalization slots have n jjric.c. 
however. The models do not represent well deviations from the assumed structure,, 
which are always present to some degree. Thus, on any particular trajectory ex- 
ecution a rigid body dynamics model will have small errors. Changing the model 
parameters to fit this trajectory more exactly will degrade performance on other 
trajectories. What ts required Is an additional level of modelling that allows repre- 
sentation of fine details of the dynamics. This correction model and how it is learned 
is described in Chapter 6, Single Trajectory Learning, 

Chapter 5, Optimal Filtering For Parameter Estimation, discusses the practi- 
cal issues of handling noisy measurements. How to optimally process the incoming 
sensory data is determined by models of the measurement noise. Improving the per- 
il 



formanee of parameter estimation algorithms used in learning involves knowledge oF 
the measurement noise characteristics, which are estimated from redundant sensing. 
Thus, the Learning elements themselves, learn, in this case by building better models 
of the sensors and their interaction with the vorld- 

An insight that simplifies system identification Is to use sensing to decouple dif- 
ferent system identification problems. We use a wrist force/ torque sensor that might 
allow us to estimate load parameters independently of the arm dynamics, for exam- 
ple, Mor* generally, covering an arm or any effector system with a sensory barrier 
such as a tactile sensing skin allows us to separate identifying internal system dynam- 
ics and identifying external system dynamics and disturbances Without a sensory 
barrier the combined system of arm plus external world may be too complex to 
identify robustly. Decoupling system identification problems allows different identi- 
fication procedures to be applied to the different subsystems. In the case of an arm 
with variable loada,. we expect the arm dynamics to be only slowly varying, while 
toad dynamics change rapidly as loads are picked up or put down. Widely different 
rates and types of system change call for different system identification algorithms 
to track the changes. Another reason For decoupling arm and load identification is 
that the arm structure is relatively constant while the dynamics of different loads 
can have quite different structures. Ami identification can be based on a fixed modet 
structure, while a complete load identification system must handle many different 
model structures. 

It la important to keep in mind the broad relevance of the model building ap- 
proach suggested here. It is. true that other sources of dynamics such as friction 
and actuator dynamics play an important roie in the dynamics of many current 
robot artiii, However, l) rigid body dynamics is often a source of coupling dynam- 
ics between the joints 2) Since friction and actuator dynamics are usually Isolated 
at particular joints their identification reduces to a single input single output mod- 
elling problem rather than the more difficult multiple input multiple output problem 
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solved here, and 3) tlic same process of hypothesising a model structure and esti- 
mating parameters can be used to identify friction and actuator dyuaiuics and can 
even be added to the linear parameter estimation algorithms used in this work, 

1.2.4 Single Trajectory Learning 

Chapter fi. Single Trajectory Learning, addresses the issue of using practice to im- 
prove the execution of a particular trajectory. The crucial insight of this chapter is 
that one must use an accurate model of the controlled system to make Senae of trajec- 
tory errors, ie, convert the errors into corrections to feedforward commands. With- 
out, an ac curate model attempts to improve trajectory performance will most likely 
degrade performance. This work demonstrates the role of knowledge in analysing 
past behavior and correcting previous mistakes, and should be compared to the 
proliferation of ad. hoc trajectory Learning schemes. 

We are able to mathematically analyse the effect of various proposed single tra- 
jectory learning algorithms, which tells US why one le an ting operator works better 
than another. The most important result is that the convergence rates of the algo- 
rithms are determined by the quality of the models used. We can put mathematical 
bounds on acceptable modelling error for '.he. linear case. 

The model used in the trajectory learning work is the identified arm model pre- 
sented in Chapter 4, Thua t starting with only knowledge of system strueturej we 
have demonstrated a system that can build a general model of itself after only three 
or four movements, and then can learn to execute any particular trajectory to almost 
the limits of the system repeatability in an additional three or four movements , 

1.2.5 Future research 

The final chapter discusses what to do neKt, This work suggests a program of 
research on the quantitative psychoph3 r sics oF motor learning. Rigid body load iden- 
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tification is a first step towards general force perception. The cjttsMisLQn of the 
trajectory ] Earning algorithm to improve performance on a group oF similar trajecto- 
ries is. di&eusstnL Also included IS a. discussion of performance simplification, which 
rcducEE the- need for detailed internal dynamic models by resirit ling pnsEtble motor 
performance. 
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Chapter 2 



Previous Work 



This chapter spells out the theoretical background and philosophy of our approach, 
and surveys the relevant engineering literature. This thesis addreaaea the following 
questions: 

1. Haw can we generate appropriate actuator command Signals to drive a con- 
troEled eyistem along some desired trajectory? 

2, How can we modify the actuator command Signals to improve trajectory execu- 
tion on the same and similar trajectories given data from previous, movements? 

This chapter describes the genera] approach taken and contrasts it with auproach.EE 
taken, by others. En summary, we 

1, Describe three basic types of controllers: 

(a) Feedback 

{b) Disturbance Feedforward 

(c) Command Feedforward 

and thtir rokis, 
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2. Describe why we have chosen to pursue command feedforward control , and 
specifically adaptive command feedforward control > and 

3. Describe previous work on adaptive command feedforward control. 

2.1 Introduction: Three types of control and adap- 
tive control 

The mot i valine control problem.: The areas of control theory that one is in- 
terested in are often shaped by the systems one desires to controh The emphasis 
here is on the control of biological and. robotic arms during the execution of free 
trajectories. 

Focusing on execution: We are taking a modular approach to motor learning, 
and have split learning into two parts: making better plans and following plans more 
closely. In this chapter we assume there Is a given plan or desired trajectory and our 
goal is to execute it as accurately aa possible, Thus we focus on the controller, the 
element which transforms plana into actuator commands, and how we might refine 
the controller with experience. Following plans more closely is the goal of adaptive 
control, the branch of control theory ttia,t addresses how to improve controllers based 
on experience. 

Three types of control: A control system can be divided into three parts: 
the feedback controller and two feedforward controllers (Ogata, 1970; Takahasht, 
RabinSj and Auslander, i970; Astromand Wittcnmark, 1964). Figure 2\ shows? the 
general form of such a system. 

Feedback control is any control action based on the actual state history of the 
controlled system, and feedforward control is \l '. nr her control actions (D'Azzo and 
HoupEs, 19GG; Astrom and Wittenmark, 1964), We can distinguish two types of 
feedforward control: feedforward commands may be based on measurements of Other 
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variables that affect the controlled system but are not affected by it (disturbance 
feedforward) or the commands may be based oit prediction* of the control signals 
necessary to drive the controlled system along some desired trajectory (command 
feedforward), 

Three types of adaptive control: Corresponding to the different parts of the 
controller there are different types of adaptive control. Adaptive feedback control 
involves adjusting the feedback con broiler, Adaptive feedforward control may involve 
adjusting gains from measured signals (adaptive disturbance feedforward control) or 
improving the models used for control signal prediction (adaptive command feedfor- 
ward control). We will focus on improving the models used for command feedforward 
control. 

2*1*1 Questions that have come up 

Generality of approach: Each component of the control system (plant, feed- 
back controller, disturbance feedforward, command feedforward) may be a complex 
non-Linear transformation. We will initially use simple examples {typically linear 
systems] to present concepts clearly, but the reader should keep In mind that much 
of what is said applies to more general systems 

Reference and feedforward signals: Note that there ate many equivalent 
ways to think about the command feedforward part of the control system, especially 
if the system components are linear (Horowitz 1 1963) , We can think of a reference 
signal to the feedback controller as another feedforward signal. For conceptual clar- 
ity, we want to separate modifiable feedforward actuator commands from reference 
signals to the feedback controller. The reference signals will always be the desired 

trajectory. 

Terminology: Although the control terminology is not completely standardized 
the term "feed Forward" is used quite often in the literature; for example, feedFor- 



19 



ward control for measured disturbances ( Grab be r Rarno, and WoQldridgej 1961; 
Predion, lOeSi Savas, lQCSi Weyrick, 197G; Marshall, 19T&; Owens, 1978; Anand, 
1984; Schwarzenbach and G3l! T 19S4; Leigh. 1985 ), feedforward contra] For com- 
mand inputs [ Ton, 1959; Webb, 1964; Eveleign, 1972 ), feedforward control for 
disturbances and commands ( Macmiilan, 1951; Bocbelin, 1962; Shins-key, 1965; 
Woolverton and MurriH, 1967), In the Russian literature the "Theory of Invari- 
ants" is often invoked to show how and why feedforward control should he used to 
make the system error independent of commands and disturbances. This has also 
been referred to as "PofttfUt'j principle 1 * [PremJnger and RoDtcnberg, 1964]. Com- 
mand feedforward Is often referred to as an input or reference prefilter, especially if 
this Signal is used as the reference to the feedback controller. Combined feedback 
and feedforward control has been referred to as a "two degree of freedom control 
system" (HorowUst, 1963). There is occasionally some confusion in the use of the 
term feedforward. "Feedforward controller 71 has been used to refer to any element 
in tbe forward path of the closed loop controller (Raven, 197S; Saridls, 19TTJ. These 
elements are clearly part of the feedback controller, 

2.2 The Feedback Controller 

Feedback control is the only ivay to handle unmeaRured or unpredictcd disturbances, 
modelling crrors 1 and to stabilize unstable plants, and thus plays a critical role in 
control in general and in robotics in particular. Much effort is going into robot 
feedback controller design, sometimes to tbe exclusion of considering other types of 
control. However, there are limits to what can be done with feedback control , and 
feedforward control can usefully augment a feedback controller. It should be noted 
that the feedback controller can be designed to a large extent independently of any 
feedforward control,, separating issues of robustness- to disturbances and modelling 
errors, and command tracking [Horowitz, 1963). 
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2*2*1 Limits on feedback controller 

Control of terminal compliance or, mote generally,, impedance has been proposed as 
a goal for robotic control. Such control may require limiting feedback gains, if the 
desired compliance is implemented as a low gain, position servo, Fur non-redundant 
robots Willi no terminal force/torque sensing ch nosing an impedance specifies the 
feedback controller completely. 

The use of a force sensors at the interface between the robot and its load or 
environment may allow differential rejection of modelling errors and external forces, 
In practice, however, this has not yet been shown to work well. 

In biological systems there are additional limits on the possible feedback Rams 
due to transmission, delays. Ln general,, non minimum phase elements such as delays 
(which also occur in machines) and right half plane SCTOS Bet limits Oil maximum 
feedback gains as do modelling error, ignorance of plant dynamics, parameter vari- 
ations,,, and measurement or command noise. (Astrom and Wittenmark, 1£S4) 

2,2.2 Plenty of work going on in adaptive feedback control 

Approaches to adaptive control have been almost exclusively focused on modifying 
the feedback controller (W. D, T. Davies, 1970; Tsypkin, 1971; Wittcnmark, lOTS; 

Saridii, 1977; Landau, 1070- Harris and Billings, 1981; Jsermann, 1982; Astroni. 
1955; Landau j Tomkuka, and Auslander, 1933; Astrom and Wittenmark, 1<JM; 
Goodwin and Sin, IMA; Voronm- p.nd Rutkovsky, 1984; Yale, 1955). 

There Las also been Rreat interest in adaptive feedback control in robotics, The 
added complexity of an adaptive controller is typically justified on the basis of the 
complexity of the robot dynamics, the belief that the parameters in robot models are 
difficult to estimate, that loads will be unknown, and that there will be significant 
modelling errors caused by friction [static antl dynamic), joint compliance, actuator 
modelling errors, and link flexibility. Many of the justifications for adaptive con- 
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trol are not valid, and it is not clear that adaptive feedback controllers have any 
advantage over well designed non-adaptive coiL'.rollvfa, Furthermore! the limitations 
on feedback control discussed previously limit the performance of adaptive feedback 
controllers just as they limit fixed gain controller pcrrorman.ee. 

2,3 Feedforward Control 

Feedforward control is any control action that is independent of the previous actual 
state history of the plant, and is used to improve command following and counteract 
predictable or measurable disturbances^ Feedforward control lias been used for a 
long time (Graham, llMo; Moore, 1951; Preminger and Rootenberg, iG63) T and in 
fact many control systems use feedforward control exclusively and thus are pure 
oucn loop controllers, 

Aenmiiaci! applications: Feedforward control is used extensively in aerospace 
applications and is related to the use of optimal control there, Typically a rocket 
trajectory is planned according to some cost criterion and the thrust history neces- 
sarily to follow that trajectory is pre-calculated. The feedback controller counteracts 
deviations from the planned trajectory by modifying the nominal thrusts. Adaptive 
feedforward control has not been emphasized in these applications as 1 he ^mn mckc'. 
is rarely used twice, 

Process control applications: Feedforward control has also been used in pro- 
cess control systems (Macmillan, 1&51; Shinskey, 19C3; Savas, 19®5\ Weyrick, 197&; 
Marshall, 19ft*; Shinskey, 1979), These applications tend to focus on counteracting 
measured disturbances since desired outputs tend to be constant for long periods of 
time and outputs during transients or changes in production are usually discarded, 
Also> process control is characterised uy Jong delays, large time constants (slow 
p:fi cesses), and measurable disturbances. 
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Robotics applications I Feed Forward control is an import A.nt part of robot 
control (Orin et al. t 197G; LieReois, Fonmier, and Aidon, ]98Q; P&u] s 1981; Brady 
et at,, 19&2; Asada f Kanade, and Takeyama, 1983; GerHtcnber^cr, lftfls- Hollars and 
Camion, ltlSJ} Licgcoia. LOSS), although there ia edihe variation in the terminology: 
"Inverse problem" (Paul, 19^2); "Computed torque™ {Markiewics^ L973; Bejczy, 
1974); "Two Stage 1 * control (Vukobratovit and Totkonjak, 19&2; Vukobratovid and 
Stokrt, 1&82). 

There are several reasons for the emphasis on feedforward control lit robotics: 
j) Often the task or movement trajectory is specified in advance^ and 2) the forces 
necessary to drive the robot along the specified trajectory are usually dominated by 
the forces necessary to drive the robot itself, and not the toad s and thus are to a 
large extent predictable. 3) Most robots must interact compliantly (i.e. with low 
stiffness) with the environment and thus feedback gains are limited. 4} OfLen the 
task Of trajectory is repetitive and the disturbances are the same on each execu- 
tion, fi) Computers are typically available for the generation of complex feedforward 
commands. The goal of robot trajectory control is to iiriiuntize deviations from an 
arbitrary desired trajectory, which with a limited gain feedback controller requires 
accurate models for the generation of feedforward commands and the ability toreFine 
those feedforward commands on repeated executions of the same trajectory. 

Biological applications; Biologists have used the concept of feedforward con- 
trol to understand commands based on measured signals and commands based on 
predicted forces (Szentagoihai and Arbib, 1975). The vestibular ocular reflex Is 
ail example of both disturbance feedforward control and adaptive disturbance feed- 
forward control. Fast limb movement is hypothesized to be driven by command 
feedforward control, as. the delays in feedback control in biological systems limit 
possible gains and feedback effectiveness. 

Technological limitations: To a certain extent the use of feedforward control 
has been limited in the past by the available technology (Peschon f 1965; Savas, 

23 



1965; Xsermann, 1981; Isermann s 19&2; Astrom and Wittenmark, 1934). The cost of 
analog electrical, mechanical h pneumatic, fluidie, or hydraulic controller circuits is 
proportional to their complexity, and implementing derivative, operator, nonlinear 
operations, time. delays, or functions such as sin(0) is relatively difficult. These 
circuits are restricted to be causal processors, and thua derivatives oF the command 
must cither be provided to the control le.r or approxi:iLL.u: !"A:. InturcLilLjLe mechanical 
differentiator is presented in Docbelin, 1962, p,28G)- These derivative? are typically 
useful in feedforward COntroL Furthermore, the reliability of analog circuitry is 
inversely proportional to its complexity, and this has encouraged the use of simple 
controllers. Some mechanism for information storage and retrieval is useful For 
feedforward control „ as wt shall later demonstrate, and in the past this was restricted 
to such technology as mechanical cam following, The introduction of computer 
control and the dramatic drop in the cast of computer hardware has greatly expanded 
the range of possible controller designs. 

2.3*1 Feedforward Controller Design 

It is well known that the ideal feedforward controller incorporates a model of the in- 
verse of the plant (Graham, 1*46; Moore, 1951; Tou^ 105*; Shinskey, L963; Evcleigh, 
1972; Wcyrick, 1975; Owens, 197S; Shinskey, 1979; Isermann, 1981; Isermann, 19S2i 
Aatromand Wittenmark, 19&-1), One implication of this is that adaptive feedforward 
contra] involves building better models of the plant. 

The need to invert the plant immediately raises the question of what to do when 
the inverse does not exist or is not realizable. This occurs when the plant has any 
of the following features: 

1. Time delays: The inverse of a time delay is a time advance, which is not 
realizable, 

2. Right half plane zeros: A causal inverse of a right half plane zero is unstable, 
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and a stable inverse: is not realizable Since it requires control actions in the 
infinite past, 

3. Many to one mappings: A plant may map many sets, of inputs into the 
same outputs, and thus formally the plant inverse does not exist. This is not 
typically a real problem, as additional constraints tan be added to resolve the 
many to one mapping, or a particular inverse can be chosen. 

4, Singularities: p j n i s or BC t B of points at which a nonlinear or lime varying 
linear plant mapping changes dimension may complicate inverting the plant. 
We will rely on the planning stage to avoid Ibis problem, at least for now. 

We see that realizable ly of the plant inverse is a critical issue in feedforward 
conmiand design. This leads us to distinguish disturbance feedforward and command 
feedforward. 

L Future commands are Jtnowm A disturbance is assumed to be unpre- 
dictable, or at least to have an unpredictable component, while we assume that 
commands are known as far into the future as necessary-- Of course, the fu- 
ture command may change, and this must be accommodated by the command 
feedforward controller, Some amount of acausality in command feedforward 
generation is acceptable, The disturbance feedforward controller, however, 
must be a causal processor. 

2. Commands have negligible noise: Tn addition, disturbance measurements 
may be corrupted by noise, while commands, being interna) to the controller, 
are assumed to be have negligible noise. Thus, derivative operators can accu- 
rately take derivatives of the command. 

The goal of disturbance feedforward controller design is to design a causal and 
stable system whose input is disturbance measurements and whose output is ap- 
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propria!*; control actions, Approaches to designing such a system are discussed in 
(Isermann, 1961), 

It is possible to treat command feedforward control as a type of disturbance 
feedforward control and attempt to design a causal command filter. This is not 
necessary the best approach, however. The goal of command feedforward control 13 
to design, the particular actuator command ssgrial for a particular desired trajectory. 
This processing need not be causa] . To handle a plant delay it is possible simply to 
shift the command backwards in time, To handle a right half plane zero it is possible 
to design conin'uiilds that do not grow to Infinity. This typically requires that the 
command starta in the infinite past, Since this is clearly not possible this type of 
command must be truncated „ leading to unavoidable trajectory following errors. 

2.4 Previous Approaches To Adaptive Feedforward 
Control 

2,4-1 Self Tuning Adaptive Feedforward Control 

A straightforward approach :o adaptive feedforward control is. to treat command 
and disturbance feedforward control identically and simply automate existing feed- 
forward controller design procedures. Initially the plant and any disturbance effects 
must be identified, and then a controller is designed from that. There are typi- 
cally two steps lo identification; choosing a model structure and then estimating the 
free parameters in that structure. By making different choices in mode] structures, 
parameter estimation procedures! and feedforward controller design many different 
adaptive feedforward control schemes can be derived. This approach Is analogous io 
"Self Tuning 1 * adaptive feedback control (Isermann, 19B1; Jsermann h 1982; Astrom, 
10S3; Astrom. and Wittenmark, 19&4], The perils of right half plane zeros are typi- 
cally avoided by using a model structure that cannot contain right half plane zeros. 
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This may lead to an approximation of the true plant 1 but the inverse of the model 
wt]l always be stable, 

Woolvertcm and Murrill (1967) suggest choosing initial parameters for a second 
Older feedforward Controller from experiments on the controlled process and then 
a trial and error exploration of the nearby parameter space to Eind the optimum 
parameter set. 

Bristol (1067) chooses a simple linear model structure for a heat exchanger s 
V" = a ' L 4- &, and closes an integrating feedback loop around the free parameters a 
and b to continuously update this modeL The model is concurrently used to Reneratfi 
feedforward commands, V n from load measurements, Lr An objective of thia scheme 
is to only include operations easily imploncntable in analog hardware and the final 
controller Was pneumatic. 

Bernard and LefkoVi'itZ (1962) clioae a more complicated process model 

KeT Dt 

where P(e) is the process transfer function, K is the gain, D is the delay, and L 
is a time constant. Initially K n T>, and L are estimated from previously collected 
data, and then D and L ate continuously updated. Tins model is used to generate 
bang-hang control responses to step changes in measured disturbanceSr 

Wittcnrnark [1073] derives a self-tuning minimum variance feedforward con- 
troller, 

Schumann and Christ [1979) present the most extensive exploration of self tuning 
adaptive feedforward controllers. They make explicit three steps: 

1. Choose structural parameters of process and disturbance models' model order, 
time delays, sampling interval, etc. 

2. Use some recursive parameter estimation scheme to identify the model param- 
eters Typically recursive 3-cl.sL squares :a used although if there is aL^ tlj Lie llil! 
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process noise an unbiased scheme such as recursive maximum likelihood is 
more appropriate. 

3. Design a feedforward controller based on the identified model. Here we assume 
the certainty equivalence principle is valid. 

The process and disturbances ana modeled by separate A.RMA models. Since their 
model structure allowed right half plane zeros they needed feedforward design pro- 
cedures that handled this problem, They explored Beven feEdforward control design 
procedures, 

Elkabe and Mcira (L&53J extend the adaptive model following control approach 
(Landau, 19T9) to adapt a feedforward controller to handle measurable disturbances. 

Widraw and colleagues [Widrow, Wal&ch., and Shaffer, 19S3; Widrow and Walach, 
1983; W id row et al, p 19S1; Anderson and Johnstone, 1981, Widrow h McCool, and 
Medoff, 1&79) model the plant as an all pole process, thus avoiding the right half 
plane zero problem. They use an LMS (least mean square} algorithm to continuously 
model the plant. They approximate the true plaflt in a least squares. seflJC, and are 
able to drive the plant approximately alonR a trajectory after a certain delay, 

Adaptive feedforward control in robotics 

2.4.2 Adaptive Global Models 

Oill- approach to adaptive command feedforward control is to attempt to identify a 
model of the robot dynamic*, and invert the model, or to directly identify an inverse 
model of the plant. In this section we will outline approaches which attempt to 
model the robot dynamics under all conditions with one modeL We refer to this 

approach as the global modelling approach. 

One approach for such global modeling is to derive a rigid body dynamics model 
of the arm and identify the parameter of that model. Examples of such work are 
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discussed in Chapter A and include Mayeda, Os-uka, and Kangawa { 198-4) T Mukerjee 
(1981}, Mukerj-ee and Ballard (19S&), Olsen and Bekey (1985), Neuman and Jihosla 
(19B&}, Ati h Atta&Oil, &nd Hollerbach (IQ&5), Khosla and Kaiiade (19*15). 

Goor (1985a,, 1985b) emphasized the actuator dynamics and claimed a decoupled 
third order model or each joint of the robot was more appropriate. He incorporated 
a third Older command feed Forward filter and developed an algorithm that esti- 
mated the coefficients of that filter in real time. He also emphasized the necessity 
of commanded trajectories to be executable, i.e. not require infinite inputs. 

A key issue in adaptive global modeling is that the identified model should be 
constant or at least independent of the state of the plant, even during large motions 
of the robot. If this is not the case then it is not clear that there is enough data, 
during the movement of the robot to identify a new model at each significantly 
different state. 

Another key issue is to injure that the data used to estimate the robot model 
cover a wide range of states and inputs. Otherwise structural modelling errors 
may drive the parameter estimation procedure to seriously distort the estimated 
parameters to fit a small range of data, This can cause predictions for other ranges 
of data not used in the estimation to deteriorate, and the advantage of global mnrleh, 
generalization, is lost. 

2.4.3 Lookup Table Based Approaches: Local models 

Lookup table based approaches have been suggested in the past as a solution to the 
model evaluation problem; given an existing model can we compute the necessary 
inputs to achieve the desired Outputs fas-L enough? [A3 bus t 1975a,, 1975 bj ftaibert, 
19TT, 19fr8; Raibert and Horn,. 1978) One way to fill the tables is using some form oF 
learning. Tables do provide an easily modifiable representation of a transformation. 
However, a Straightforward implementation leads to huge table sizeEj and consider - 
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able effort nm^ be expended in designing how data is actually stored^ and how It is 
put in and taken out of the table. A major Daw of tabular approaches is that it ia 
difficult to take advantage of the structure of the arm dynamics, so much mare data 
has to be collected to learn to the same level of performance as the algorithms de- 
scribed in this thesis, and generalizations between dissimilar movements are difficult 
to make. 

We will eventually be suggesting table based approaches to form a middle level 
of models: local models. Global models help us achieve generalization, and single 
trajectory models allow us to follow a single trajectory with aero error after practice. 
Lookup table baaed local models can help us modify commands for a particular 
trajectory to achieve similar trajectories. An important distinction between our 
approach and previous approaches is that we propose using a hierarchy of several 
different types- of models to gain the benefits, of each and the drawbacks of none. 

3.4,4 Iterative Trajectory Learning: Single trajectory mod- 
els. 

At the Opposite extreme of global modeling is the modeling of the feedforward com- 
mand for a single trajectory. Through repeated attempts to follow a single specified 
trajectory the feedforward command is refined, and ultimately the trajectory is fol- 
lowed TV 1th zero error. The key issues here are the stability and com'trgence rate of 
the iterative process, and how to design the learning operator. One drawback of this 
approach is that it only produces the appropriate command for a single trajectory- 
There is little guidance as to how to modify that command for similar trajectories. 
This, approach is discussed in Chapter 6, Single Trajectory Learning, 
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Chapter 3 

Rigid Body Load Inertial 
Parameter Estimation 



3,1 Abstract 

A method for estimating the mass, the center of masa, and the momenta of inertia of 
a rigid body load during general manipulator movement; is presented. The algorithm 
is derived from the Newton-Euler equations and. incorporates measurements of the 
force and torque from a wrist force-torque sensor and of the arm kinematics. The 
identification equations are linear in the desired unknown parameters, which are 
estimated by least squares. We have implemented this identification procedure on a 
FUMA 600 robot equipped with an RTI FS-B wrist force-torque sensor, and on the 
MIT Serial Link Direct Drive Arm equipped with a TSarry Wright Company Aatek 
wrist furce-torque sensor. Good estimates were obtained for load mass and center 
of ma53 v and the forces and torques due to movement of the Joad could he predicted 
accurately. The load moments of inertia were more difficult to estimate. 
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3.2 Introduction 

This chapter present* a method df estimating all of the incrtial parameters of a rigid 
body load using a wrist Farce-torque sensor: the mass, the moments of inertia, the 
location of its center of mass, and the objects orientation relative to a force sensing 
coordinate system. The procedure has three steps: 

I, A Newton-Eider formulation for the rigid body load yieldR dynamics equations 
linear in the unknown ineftial parameters, when the moment of inertia tensor 
is expressed about the wrist sensing force coordinate system origin, 

Z, These inertia! parameters are then estimated using a least squares estimation 
algorithm. 

3. The location of the load's center of mass, its orientation, and its principal 
moments of inertia can be recovered from the sensor referenced estimated 
parameters. 

In principle, there are no restrictions on the movements used to do this load identifi- 
catiou, except that if accurate esti [nation of all the parameters is desired the motion 
must he sufficiently rich (ie„ occupy more than one orientation wsLh respect to grav- 
ity and contain angular accelerations in several different directions). In practice, 
however, special teat movements, must sometimes he used to get accurate estimates 
of moment of inertia parameters. 

There are several applications for this load identification procedure. The accu- 
racy of path following atsd general control quality of manipulators moving external 
loads can he improved by incorporating a model of the load into the controller, as 
the effective inertia! parameters Dr the last link of the manipulator change with the 
load. The mass, the center of mass, and the moments of inertia constitute a com- 
plete set of inertia] parameters for an object; in most cases,, these parameters form a 
good description of the object, although they do not uniquely define it. The object 
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may be completely unknown at first mill an inertia! description of the object may be 
generated as the robot picks up and moves the object. The robot may also he used 
in a verification process! in which the desired specification of the object is known and 
the manipulator examines the object to verify if it is within the tolerances. Recog- 
nition, finding the beat match of a manipulated object to one among a set of known 
objects, may also be desired. Finally,, the estimated location, of the center of mass 
and the orientation of the principal axis can be used to verify tint the manipulator 
has grasped the object in the desired manner, 

A key feature of our approach LS that it requires no special test or identification 
movements and therefore can continuously interpret wrist force and torque sensory 
data during any desired manipulation. Previous methods of load identification were 
restricted In their application, Paul [19S1) described two methods of determining 
the mass of a load when the manipulator is at rest, one requiring the knowledge of 
joint torques and the other forces and torques at the wrist. The center of mass and 
the load moments of innrtia were not identified, 

CoifFet (198 3-) utilized joint torque sensing to estimate the mass and center of 
mass of a load for a robot at rest. Moments of inertia were estimated with special 
test motions, moving only one axis at a time or applying test torques. Because of 
the intervening link masae* and domination of inertia by the musa moments, joint 
torque Muring is less accurate than wrist force-torque sensing, 

Olsen and BeVey (1985) assumed full force-torque sensing at the wrist to identify 
the toad without special test motions. Mukerjee (1984), and Mukerjcc ii Ballard 
{1985) developed an, approach similar to ours, again allowing general motion during 
load identification. Nevertheless, neither paper simulated or experimentally imple- 
mented their procedures to Verify the correctness of the equations or to determine 
the accuracy of estimation in the presence of noise and imperfect measurements. 

Our algorithm requires measurements of the force and torque due to a load and 
measurements or estimates of the position, velocity, acceleration, orientation, an^u- 
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lar velocity-, and angular acceleration of the force sensing coordinate system. The 
Algorithm can handle incomplete farce and torque measurement by simply elimi- 
nating the equations containing; missing measurements. The necessary kinematic 
data can be obtained from the joint angles and, if availably the joint velocities of 
the manipulator, The inertia! parameters of a robot hand can be identified using 
this algorithm and then the predicted forces and torques due to the hand can be 
subtracted from the sensed forces and torques. 

This inertia] parameter estimation algorithm wag implemented u^ing a PUMA 
600 robot equipped with an RTI FS-B wri&t force-torque sensor, and on the MIT 
Serial Link Direct Drive Arm (DDA] equipped with a. Barry Wright Company Astek 
FS6-120A-2QQ 6-axis wrist force-torque sensor, 

3*3 The Newton-Euler Approach To The Load 
Identification Problem 

3.3.1 Deriving The Parameter Equation 

To derive equation* for estimating the unknown Inertlal parameters „ the coordinate 
systems in Figure 3.1 are used to relate different coordinate frames and vectors, O 
is an inertia! or base coordinate system! which is fixed in space with gravity painting 
along the —x axis, P is the force reference coordinate system of a wrist force-torque 
sensor rigidly attached to the load, Q represents the principal axis of the rigid body 
load located at the center of mass. The x axis of Q is along the largest principal 
moment of inertia, and the z axis along the smallest, Q is unique up to a reflection 
in bodies with 3 distinct principal momenta of inertia. In the derivation that follows 
a A vectors a™ initially expressed in the base coordinate system O, 

The mass, moments of inertia, Location of the center of mass, and orientation, of 
the body (a rotation gr R from the principal axes to the force reference system) arc 
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FORCE REFERENCE FRAME 

m 




OBJECT REFERENCE FRAME 



INERTIAL REFERENCE FRAME 



p: position vector from th* origin of the base coordinate frame 
to the origin of the wrist ransor coordinate frame, 

q? position vector from the origin of the base coordinate frame 
to the center of the mass of the load. 

c: position vector from the origin of the wrist SFr.F-or coordinate 
frame to the center of the mass of the load. 

Figure 3,1: Coordinate Frames. 
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related to the motion of the Load and the Fotceb and torques exerted on it hy the 
Newton-Euler equations. The net force t f and the net torque ,11 acting on the load 
at the center of mass are: 

Hf f = f^ms = mq (3.1J 

,n=a-cxf = ^i + w)((» (3,2) 

where: 

f = the farce exerted by the wrist sensor on the load at the 

point p f 
m = the mati of the load, 

g = the gravity vector (g = [0, 0, -9.8 metero/Bec*]), 
4 = the acceleration of the center of mass of the load, 
& = the torque exerted by the wrist sensor on the load at the 

point p, 
c = the unknown location of the center of mass relative to the 

force sensing wrist origin P f 
jl = the moment of inertia tensor about the center of maaa, 
u = the angular velocity vector, and 
w = the angular acceleration vector. 
We need to express the force and torque measured by the wrist aenaDr in terms of 
the product of known geometric parameters and the unknown inertia! parameters. 
Although the location of the center of mass and hence its acceleration q are unknown,, 
q is related to the the acceleration of the force reference frame p by 

q=p + wxc + w>i[ w xc) (3 [7,40] 3 } 

Substituting [3) into (3.1), 

f = mp - mg +wxmc + (*rx [tux mc) (3.4) 

a Equation aumken in brKlitt* nftr to equation* in Symtq, 19T1- 

3G 



Substituting (3.4) into (3,3), 

11 = f Iu + at x (g.rw) + tnc a (w x <■-) -\- hk x (w x [tit x c)) 

+ mc X p — mc x g (5) 

Although the terms CX (w X c) HJld C X (w X (w Xc}) are quadratic in the unknown 
location of the center of mass c, these quadratic terms are eliminated by expressing 
the moment of inertia tensor about the force sensor coordinate orijjin fplji instead of 
about the lc:jUt of mass { f I). Rewriting (&) as; 

ii = f I* + w x [ q lw) + n?((e r e)l - (cc r )]w 

+ w x (Rl[(c r e)l - (cc r );ttf) + me x p - mc x g (6} 



and using the three dimensional, version of the parallel AXL3 theorem 

„! = ,! 4- m[(e T e] !-(«*)] 



[7 [10.147]}. 



to simplify it results in: 



n = p Iw -+- Lt x (jluj -+■ tnc xp- mc x g 



(3.HJ 



[1 is the 3 dimensional identity matrix). We now express ftl! the vectors ill the 
wrist sensor coordinate system P> since then the quantities c and p I are constant. 
Moreover, the wrist forr.e- torque sensor measures forces and torques directly in the 
F coordinate frame. 

In order to Formulate the above equations as a ays-tern of linear equations, the 
following notation is used: 



W X C = 



- uj x w r 



u.', 



— --.■ 



—u a 

W, 






= [wxj C 
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where 

Ju Iki Ji j 
i - i /« J sa /» [8.11) 

Jig ijj. /jj 

Using these expressions, Eqs. (3.4] and (3.&) can be written as a single matrix 
equation in the wrist sensor coordinate frame: 
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P-S [wx]+ [wx][wx; 

o [(g-p)x] [*JjJ + [wk][H 



(3,12] 



or more compactly* 



w = A^ 



[3.13) 



where iv is ?. element ^tl'jllJl vuciLor combining both the force fun! torque vcc'.srs, 
A is a 6 x 10 matrix, and $ 5a the vector of the 10 unknown inertia] parameters. 
Note ttia.t the center of mass cannot be determined directly, but on]y aa the mass 
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moment frcc. But eince the mass m is separately determined , its contribution can 
be factored from the mase moment later, 

3,3*2 Estimating The Parameters 

The quantities inside the A matrix arc computed by direct kinematics computation 
(Lim, Walker,, and Paul, 1960b] from the measured joint angles. The elements of 
the W vccLorare measured directly by the Wrist force sensor. Since (3,13) represents 
G equations and 10 unknowns, at least two data points are necessary to solve for the 
# vector, Lc, the force and the position data sampled at two different configurations 
af the manipulator. For robust estimates in the presence of noise 1 we actually need 
to U£c a larger number of data points. Each data point addE 6 more equations, while 
the number of unknowns, thE elements of #, remain constant, w and A can be 
augmented with n data points:: 



A = 



AI| 



A\«) 



w = 



w[l] 



w[n| 



(3-14) 



where each A[i\ and w[i] are matrix and vector quantities described in (3,12). For- 
mulated this way, any linear estimation algorithm can be used to identify the «£ 
vector. A simple and popular method is the Least squares rrjelhodr The estimate for 
4 is given byi 

£=(A r A)-'A T w (SAB) 

Equation {3.15} can also he formulated in a recursive form [Ljung and Sodersfcrom, 
1983) for on-Eine estimation. 

3*3+3 Recovering Object And Grip Parameters 

The estimated inertia! parameters (it* p mc, pi) are adequate far control but For 
object recognition and verification we aiso require the principal momentB of inertia 
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Jii Jji Jji the location of the tenter of mass r, an A the? orientation fP R of Q with 
respect; to P. 

The parallel axis theorem is used to compute the inertia terms translated to the 
center of mas* of the load. 

,i = t !-*[(£ r s)i-(es T )] (3.i7) 

The principal moments are obtained by diagonalizing f I. 
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R r (3.18) 



This diagonal zation can always be achieved since (I IS symmetric, but when two or 
more principal moments are equal the rotation matrix, , ;1J ,R, is no longer unique. 

3,4 Experimental Results 

3.4*1 Estimation on the PUMA Robot 

The inertia] parameter estimation algorithm was originally implemented on a. PUMA 
600 robot equipped with an RTl FS-H wrist force-torque sensor [Figure 5-2), which 
measures all six forces and torques. The PUMA 600 has encoders, at each joint to 
measure joint angles, but no tachometers., Thus, to obtain the joint velocities and ac- 
celerationa k the joint angles are differentiated and doublc-diiz'LTuntialed, respectively, 
by a digital differentiating filter (Figure 3-3), The cutoff frequency of 33 Hz for the 
filter was determined empirically to produce the best results. Both the encoder data 
and the wrist sensor data were initially sampled at 1UUU Hz, It was later determined 
that a sampling rate of 200 Hi was sufficient, and the data were resampled at the 
Lower rate to reduce processing time. A least squares identification algorithm was 
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Figure 3,2; Puma with a test Load, 

implemented as an off-line computation, but an online implementation would have 
been Straightforward. 



Static Estimation Using The PUMA 

To test the calibration of the force sensor and the kinematics of the PUMA arm a 
static identification was performed. The forces and torques are now due only to the 
gravity acting on the- load t and equations (3,4) and (§.$) simplify to 

f = -mi (3.19) 

n = -*nc x % (3.2Q) 

Aa seen in {3.19) and (3.20), only tlte mass and the center of mass can he identified 
whEle the manipulator ts stationary. 

To avoid needing to determine the gripper geometric parameters, the center 
of masa ^sti mates are evaluated by the estimates of the changes in the center of 
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Figure 3,3: Measured angle 0, calculated angular velocity fi t and calculated angular 
acceleration & for joint 4. 
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P arametera 


Aetvai 

Values 


Static 
Estimates 


Dynamic 
Estimates 


Mass {kg} 
Change in £ v [m) \ 


1,100 

0,037 


1.103 
0.037 


1.067 
0,039 


Mass (kg) 
Change in c„(m) 


1.106 
-0,043 


hot 

-0.043 


1,054 
-0.042 


Mass (kg) 
Change in c^m.) 


1.106 
-0.021 


1.100 
-0.030 


1.073 
-0.021 


: Mass (kg) 
Change LJi t 9 {m) 


1.106 
0.0 L8 


1.099 
0.018 


1.074 
0.030 



Table 3.1: Mass and the center of mass estimates 

mass an the load is moved along the y-axte from the reference position by known 
amounts. The results of estimation are shown in the second column of Table 3,1 
for an aluminum block [2x 2 X 6ut.) with a mass of 1,100/f g. Only the changes In 
t B are shown in Table 3.1; the estimates of t x and e r remained within Itntn of the 
reference Values (e, = irnm and c £ — 47j7im). Each set of estimates were computed 
from 6 sets of data,, i.e. data taken at 6 different positions and orientations of the 
manipulator,, where each data point k averaged over 1DD0 samples to minimise the 
slfects- af noise. TIie results show thai in the static case the mass of the load can he 
estimated to within 10# of the actual mass. The center of mass can be estimated to 
within lmm of the actual values for thia load. 

Static load estimation only tests the force sensor calibration and the position 
measurement capabilities ■ui' the robot the sensor is mounted on. In order to assess 
the effects of the dynamic capabilities of the robot, on load estimation and to be able 
to estimate the momenta of inertia of the load we must assess parameter estimation 
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during general movement. 

Dynamic Estimation Using The PUMA 

In the dynamic case, the joint position Encoder and the wrist sensor data are sampled 
while the manipulator is in motion. A fifth order polynomial trajectory (a minimum- 
jerk time Function) in joint space was used to minimi se the mechanical vibrations at 
the beginning and the end of the movement, and to improve the signal to noise ratio 
(SNR) in the acceleration data (Figure 3.5). For more popular b&ng-coast-bang type 
trajectories, the joint accelerations are zero except at tlie beginning and the end of 
the movements, resulting in a poor signal to noise ratio in the acceleration data for 
most of the movement. 

We found that the PL" VTA robot lacked the acceleration capacity necessary to 
estimate the momenta of inertia of the load, It also lacked true velocity sensors 
at the joints p which made estimation of the acceleration of the load difficutt. The 
dynamic estimates of mass and center of mass for the previous, load are shown in 
the last column of Table 3.1, The data used in these estimates were sampled while 
the manipulator was moving from (Q N 0, 0, -90, 0, 0] to (90, -60, 90, 90, 90, 90^ degrees 
on a straight line in joint space in 2 seconds. Joint 4 of the PUMA has a higher 
maximum acceleration than the other joints, and thus „ a longer path was given for it. 
Thia movement was the fattest the PUMA can execute using the fifth order trajec- 
tory without reaching the maximum acceleration for any of its joints, The estimates 
used all 400 data paints sampled during the 2 Fecond movement. The results show 
slight deterioration in these estimates when compared to the static estimates; but 
they are still within 40$ and 2mm of the actual mass and displacement,, respectively. 
However, the signal to noise ratios in the acceleration and the force-torque data were 
too low for accurate estimates of the moments of inertia for this load [0.00238 Kg -m* 
in the largest principal moment), In this case h the torque due to gravity is approxi- 
mately 40 times greater than the torque due to the maximum angular acceleration 
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Parameters 


.dciwaJ 


PUMA 1 


PUMA 1 


DDA ] 


i*9 m S ) 


Verities 


Kfifl17l£lfrfl 


Estimates 


.Efltimafea 


hi 


0,0244 


0.01G2 


0.0240 


o.cmn 


I» 





-0,0048 


Q.QOOG 


0.0006 


I» 





0*0019 


0.0008 


o.oons 


In 


0,0007 


0,0021 


0,003C 


-0.0002 


I» 





-0.0016 


-0.0004 


-0.0002 


^M 


0.0242 


0.01 T6 


0.0199 


0,0241 



Table 3,2: Actual and estimated moments or inertia, either for all joints moving 1, or 

special test motions 5 , 

of the load. Thng, ■even slight noise in the data would result in poor estimates of I. 
Therefore, experiments with a larger rotational load were performed for the es- 
timates dF the moments of inertia. The new experimental load is shown in Figure 
3.2. This load haa large masses at the two ends of the aluminum bar s resulting in 
large moments of inertia in two directions (— Q.[)24kg ■ m'} and a small moment in 
the other. A typical set of estimates of the moments of inertia at the center of mass 
frame for the load with the grippeT subtracted out are shown in Table J r 2 for the 
above all-joillts-inuving trajectory. They contain some errors but are fairly close to 
the actual values. 

Special Test Movements Using The- PUMA 

In order to improve the estimate!- by maximizing the rotational accelerations in the 
trajectories, a series of special test movements were generated. The data was sarnpEed 
while the robot was following three separate 2-second rotational trajectories around 
the principal ax.es of the load. Such trajectories used joints 4 and 6 only, and resulted 
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In higher acceleration data than the previous, trajectory, thus improving the signal to 
noise ratio in both the acceleration and the force-torque data. Typical estimates for 
these special movements show improvements over the estimates with the previous 
trajectory (Ta.ble 3.2), Although the estimate of / It is slightly worse than bEfore, all 
the other terms have improved; the cross terms, especially, aw much smaller than 
before. However, these estimates oF I are not as accurate as the estimates of the 
mass and the center of mage shown in Table 3.1. 

Figure 3.4 shows the comparison of the measured forces and torques, and the 
computed forces and torques from the estimated parameters and the measured joint 
data using the simu-ator for tins original trajectory. The two gets of figures match 
very well even in the mechanical vibrations, verifying qualitatively the accuracy 
of ths estimates. This suggests that for control purposes even poor estimation of 
moment of inertia parameters will allow good estimates of the total force and torque 
necessary to achieve a trajectory; This mates good sense in that the load forces with 
the PUMA are dominated by gravitational components, and angular accelerations 
experienced by the load are small relative to those components. 

The effect of the errors causing poor estimates of moment of inertia parameters 
could be alleviated by increasing the angular acceleration experienced by the load. 
Since we had reached the sustained acceleration capacity of even an unloaded PUMA 
robot, we decided to explore this issue using the MIT Serial Link Direct Drive 
Arm This robot can achieve higher sustained" accelerations than the PUMA and 
in addition is aLeo equipped with tachometers at the joints, making estimation of 
acceleration much easier, 

3.4.2 The MIT Serial Link Direct Drive Arm 

The mertial parameter estimation algorithm was next implemented on the MIT 
Serial Link Direct Drive Arm (DDA), equipped with a Barry Wright Company 
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Figure 3-4: Measured fbrce^torque: <Jita and computed mrce-torque Jala fiom the 
eatiinatfia using the PUMA, 
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A seek 7S6-Y10A-2QQ 6-axis force-torque sensor which again measures all three forces 
and three torques about a point. The DDA is described in An K Atkeson, HoSlerbaeh 
(lflflS},, and is capable of higher tip velocities and accelerations than the PUMA. The 
DDA has tachometers on each of its three joints so that numerical differentiation of 
positions 3* unnecessary, but we still hud to digitally differentiate the velocities to 
find the accelerations uiirtg & cutoff frEquency of iOHz. The positions and velocities 
of the robot were initially sampled at LkHs but were later down-sampled to much 
the sampling, frequency of the force-torque sensor of 240 Hz. The identification 
procedure was again implemented off-liner 

Dynamic Estimation Using The Direct Drive Arm 

The data used for estimating the inertia] parameters of the load were sampled while 
the manipulator was moving from (280 h 2-&9J,— 30) to [80, 19.1 s 220} in one second. 
Again a. fifth order polynomial straight line trajectory in joint space was used, The 
resulting estimates, for the moment of Inertia parameters are shown in the last column 
of Table 3.2, The estimates for the mass and the location of the center of mass were 
as good as the PUMA results and are not shown. We see that the moment of inertia 
parameters estimated are on the whole better than the PUMA results. 

Parameters estimated for a set of special teit movements using the Direct Drive 
Arm were not substantially different. Our special test movements for the DDA wl:l- 
not substantially faster than the movement of all joints, and thus probably contained 
the same amount of information. 

Finally, Figure 3.S shows the comparison of typical measured forces and torques 
with computed forces and torques from the estimated parameters and the measured 
joint data using the simulator for the original trajectory, Once again we have a 
very good match between the measured And predicted forces and torques. Thus we 
nee that the combination of higher angular accelerations and good velocity sensing 
results in better parameter estimates, as hoped. 

48 





F lru re 3.5: Measured force-torque data and computed forte-torque da La fram the 
estimates using the Direct Drive Arm. 
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3.5 Discussion 

3,5.1 Did The Algorithm Work? 

Thi* chapter describes ail attempt ta characterize the usefulness of wrist force-torque 
sensing for estimating the; inertlal parameters of rigid body loads for control and 
recognition /verification/grasping. Out conclusion is that prediction of force* for 
control can be good and seems to work well ir. out implementations. Identifying 
parameters well enough for recognition of the object may require large accelerations 
dt special test movements in order ta accurately identify the moment of inertia 
parameters . 

It is Imp o: tan: to realize that there are two distinct uses of an identified model. 
For control whit mattsrs is. matching the input-output behavior of the model (in 
this ease the relationship of Eoad trajectory to load forces and torques) to reality 
white for recognition/ verification what matter? 14 matching estimated parameters to 
a set of parameters: postulated for reality. We £nd that both of our Implementations, 
of load inertia! parameter estimation successfully match the input output behavior 
of the load (see Figures 3.4 and 3.5), although we have not yet used this information 
in a control scheme. However, we find that the limited acceleration capacity of the 
PUMA robot and Its limited sensing do not permit us ta e&timate the moments of 
inertia of the load accurately without the use of special test motion*. In all cases 
the mass and the location of the center of masa could be accurately estimated from 
both a series of static measurements, and measurements taken during movements. 

This work Is preliminary in that an adequate statistical characterization of the 
errors of the estimated parameters or the predicted forces has not been attempted. 
NeVertheteSS b we have gained insight into the sources of Such errors. 
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3 + 5 + 2 Sources of Error 

The ultimate source of error is the random noise inherent in the; sensing process 
itself. The noise levels on the position, and velocity sensing are probably negligible, 
and could be further reduced by appropriate filtering using a model ba&ed filkir such 
as the Extended Kalman Filter (Gelb N 1974], The forte and torque measurement 
process involve measuring the strains of structure members in the SCfiSOr with strain 
gages. The random noise involved in such measurements is also probably negligible. 

Bias Errors 

However strain gages are notoriously prone to drift. We feci that periodic recali- 
bration of the offsets (very often] and the strain to force calibration matrix (often] 
may be necessary to reduce load parameter estimation errors further. Before using 
the force. sensors we allowed the system to warm up and we recalibrated the offsets 
before each data collection session and checked for a change in the calibrated offsets 
afterward. 

Uruuodelled Dynamics 

A further source of noise is. unmodelled structural dynamics. Neither the robot 
tints nor the load itself are perfectly rigid bodies, A greater source of concern is 
the compliance of the force sensor itself. In order to generate structural Strains 
large enough Lo be reliably measured with strain gages, a good deal of compliance 
is introduced into the force sensor, The load rijndly attached to the force sensor 
becomes a relatively undamped Spring mass system, The- response of the Astek force 
sensor to a tap on an attached load is shown in the "'undamped impulse response" 
record, of Figure 3,6. 

The effect of robot movement on this spring mass system is shown in the "un- 
damped movement response* record. 
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Figure 3.6; Vibrations at load on fores aenBDr. 
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There are several approaches to deal with this prohlem of unmodeUed dynamics. 
One approach is to attempt to identify the Additional dynamics. This greatly in- 
creases the complexity of the identification process, and ijreatly increases the amount 
of data that needs to be collected to get reliable estimates of any parameter. We 
feel Euch an approach should only be taken as a last resort. 

Another approach is to try to avoid exciting the unmodeiled dynamics by choos- 
ing robot trajectories that are as smooth as possible. This is one mason why we 
chose 5th order polynomial trajectories, so that we could maintain continuity of 
velocities and accelerations, U*i::.y higher order polynomials would result in even 
greater smoothness. However, *ve found that with the PUMA a smooth commanded 
trajectory did not result in a smooth actual trajectory., in that the control methods 
used and the: actual hardware of the robot still introduced substantial vibration. 
One way to te]] if the PUMA is turned on is to touch it and feel if it Is vibrating, 
Vibrations were less of a problem with the Direct Drive Arm, although still present. 

The most successful approach is to mechanically damp out the vibrations by 
inlrod-.:cinR some form of energy dissipation into che structure, We added hard 
rubber washers between the force sensor and the load, and the "damped impulse 
response* oT Figure 3.6 illustrates the response of the force sensor to a tap an the load. 
We see that the oscillations decay much faster-. The "damped movement response* 
indicates that this mechanical damping greatly reduces the effect of movement or 
the resonant modes oF the force senior plus load. The conclusion we draw is that 
appropriate damping should be built into force sensors, just as accclerometers are 
filled with oil to provide a critically damped response for a specified measurement 
bandwidth. Failing that, energy dissipation must be introduced either into the 
structural components of the robot or into the gripper either structurally or as a 
viscous sic in. Appropriate mechanical damping may also be useful when using; a 
force sensor in closed loop force controL 
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Optimal Filtering 



The need to numerically differentiate the velocity to find the aceeleattan, or worse 

yet, double differentiate position!, great Iv amplifies whatever noise is present. One 

can avoid the need to explicitly calculate accelerations by symbolically integrating 

eqijitjpnH [HA] and (3,&), One approach to integrating the equations is presented 5n 

the appendix's and we can express the resulLmg estimation equations in matrix form 

as: 

m 

mC a 

mc, 

hi 

hi 

hi 

ht 

ha 



ft+T 

I. ' 

ft+T 



ft+T 

L A4r \ 



[3.21) 



where the drat row of [j ( [+T A<Jr] is 

t+T r|+T / ft+T \ 

+ j uxpdr-N R<frrg 

!7 (+t\ "I j-f+T 
I y I x +■ I |wx][ux|dr 



{22J 



and the- second row ia 



[(-*T " jf*" * * * + (jf +Tr Jr ) '«) "] 

[■H1H 



[wx]'»w[ dr 



[23] 
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However, we feel that an effort to characterise the various, noise sources and 

iin attempt to "optimally™ filter and differentiate/integrate the data will result in 
better estimates. If there is substantial low frequency noise or bias In the data the 

integration will amplify that noise relative to the signal frequencies of the data. 
What Is needed is a problem formulation that gives us guidance as to how to design 
data processing filters for estimation and how to handle the need to differentiate 
or integrate some of the data to supply missing measurements. We are presently 
in veRti Rating such an approach, 

3*5+3 Kinematic Errors 

Part of the error may be due to inaccuracies in the current kinematic parameters of 
the manipulator. Experiments have shown that the actual orientation of the robot 
can be up to 4* off from the orientation computed from the encoder data. 

3.5.4 Why Estimating Mdment of Inertia Parameters is Hard 

One of the factors that makes it difficult to accurately identify momenta of inertia is 
the typically large contribution of gravitational torque, which depends only on the 
mass and the relative- location of the center of mass to the force sensing coordinate 
origin. A point mass rotated at a radius of 5cm from a horizontal ajtis must complete 
a full 360" rotation in 425 milliseconds Far the torque due to angular acceleration to 
be equal to the gravitational torque, if a 5th order polynomial trajectory is used. A 
way to avoid gravitational torques is to rotate the load about ft vertical axis, or to 
have the point of force-torque sensing close to the center of mass. 

A simple example will illustrate the difficulty of recovering principle moments 
of inertia;, given the- moment of inertia tensor about the force sensing origin. The 
principle moment of inertia oF a iLiiL~onn sphere is only 2/7 of the tola] moment of 
inertia when it is rotated about an axis tangent to its surface, so that the effects of 
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any errors in estimating the niaas, the location of the center of mau, and the grit 
moments of inertia are amplified when the principle moment of inertia is calculated. 
This problem can be reduced by having th* point of force ceasing as close to the 
center of mass as possible. 

It still may be difficult to find the orientation of the principle momenta of inertia 
even when the moment ofinertia tensor about the center of mass has been estimated 
fairly accurately, This occurs when two or more principle momenta of inertia are 
approximately equal. Finding the orientation of the principle axis is equivalent to 
diagonal sing a symmetric matrix, which becomes ill-conditioned when some of the 
eigenvalues are almost equal. A twfo dimensional exampli illustrates the problem: 
Consider the diagonaEiied matrix 



co#(0) - iin(0) 
sin(fl) cob(0) 



Ai 

JU 



cca(0] «in(f) 
- sin(#) cos(S) 



[3.24) 



(3,25) 



with eigenvalues {A ln A 3 } and whose first principle axis is oriented at an angle $ with 
respect to the s axis:. By substituting X l -X i = t Into the matrix [3.24) , 

A 2 + i cos 1 [0] e cos(0) sin(0) 
f coa(0} sin(0) X } r e sin 1 (6 } 

we see that when the two eigenvalues are almost equa^ the terms of the matrix de- 
pendent on the angle, 0, become very Email. AH terms that contain angle information 
are multiplied by the difference of the principle moments of inertia,, e> With a fixed 
amount of noise in each of the entries of the identified moment of inertia matrix, the 
orientation of the principle axis 1 S„ will become more and more difficult to recover as 
the prineipls moments of inertia approach equality and therafore e approaches isro. 
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3.6 Open Questions 

3*6*1 Data Proteasing Issues 

Many of the issues discussed below wi|] be addressed in conduction with the contin- 
uation of optima] filtering research described in Chapter 5. 

Statistical Characterization of Errors 

What is the variance of our force and torque predictions? Docs this EeveE of error 

correspond to the variability af Farce and torque measurements? What is the variance 
of the estimated parameters „ for a given amount of data? What measurement noise 
does this correspond to? How close are the estimated measurement noise to the 
specifications for the force sensor? Linear least squares estimation is based on a 
particular statistical model which includes formulas for estimated parameter variance 
and predictor variances. However, it is not clear that the assumed statistical model 
is valid in this application. 

Most Efficient Movement 

Find the most efficient movement (or just a good movement) tD improve the load 
parameter estimates given what is already known about the load and which estimates 
are to be improved. This is part of a general problem in quantifying the "richness" of 
the input. The statistical model assumed for linear least squares estimation provides 
a good starting point for answering this question, 

Quantify desirable sensor properties 

What is the stiffness of the wrist sensor? The Aztec wrist force-torque sensor has 
an interna] samping rate of 4SOHi„ 3 a ]20Ui r analog low-pass filter, and a 120 Hz 
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digital loiv^pasB Biter also h for an output sampling rate of 210Hz, What at* more 
desirable sensor uttering; properties? 

Role of Velocity Sensing 

We data that tachometers helped with the Asada Arm, We could repeat the load 
identification without using the tachometer measurements, and see how much dif- 
ference the tachometers made. 

Wrist sensing vs, joint sensing 

We could compare the load estimation with a wrist sensor with load estimation, using 
joint sensors, and see if there is a substantia] diEferencs in accuracy, 

Separate Control and Recognition Modules 

Since the goale of identification for control [predict input/output behavior] and 
recognition (get good object parameter CHlLmale&) are different there should probably 
be different modules with different filtering and data handling properties. 

Effect of kinematic errors 

How robust is the load parameter estimation procedure to kinematic errors caused 
by joint measurement error and unmeasured link deflection? 

3,6,2 A Complete Load Handling System 

Use rigid body load inertial parameter estimation as a single module in a more 
complete load identification system. Handle other model structures: What are useful 
structures? What are useful parameter estimation procedures? Provide a test or 
measure for verifying that a load is actually a rigid body. What does this mean? 
Mare a rigid body than anything else? 
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Note that the prediction errors (residuals) of the estimated modet provide a 
natural measure for whctlier the body m rigid or not. 

3,6.3 Real-Time Implementation. 

Implement load parameter estimation as on-line estimation procedure. It is appro- 
priate to address here the use of apriori load estimates. 



3,7 Appendix: Integrating The Forces And Torques 
In The Force Sensor Frame 

In this appendix wc will show how to integrate the load identification equations (3,4} 
and (3,8), Throughout this derivation, we will use the superscript notation " and F 
to indicate what coordinate frame a vectoT is expressed in„ if there is any confusion. 
If there is IIO SUth superscript; tile vector is expressed in the load framft. 
The term f> is integrated using equation 7,22 of (Symon, 197 1): 



R ^™-s< a "« ,E '"'" f °* , > 



(3.26) 



were R is the rotation matrix representing the rotation from an inertial MftTdinate 
system to the force sensing coordinate system that continuously moves with the load, 
To get an integral form of this equation we simply integrate it: 

r*+T t+T ft+T 



j R ■ "p dr - (R ■ *p] + J R( a w X °p) At 



and performing the indicated rotations leaves us with 



/-1 + T t + T rt + T 

j F pU7 k }) | / ^QlX^pdV 

h i Ji 



Similarly 



j-t+T *+T ft+r H-T 

/ 9 utdr= f iii + { F w x f udr = p w 



(3.27) 



(3.28) 



(S.29) 



since w x w = 0, We also remember that 



ft+T / fl+T \ 



(3.30) 






Each of the matrices f wx, and [*w] can be integrated element by element to 
show J .r.at 

{/*****) x] - [^|^] x] (3,31) 

fV«l * = [• [f**** Jf )] = [. (^[ + ^ ] (3,32) 

The matrix |(g- p)x] can tie integrated element by element in the same way. Eajth 

matrix element of the terms [*mk][*wx] and [Pwx]>Pw] are numerically integrated 
by adding values at catSi time step. The result of this integration [a given in Equa- 
tion (3,51), 
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Chapter 4 

Manipulator Link Inertial 
Parameter Estimation 



4*1 Abstract 

A method of estimating the mass,, the location of center of mas, and the momenta 
of inertia of each, rigid body link of a. robot during general manipulator movement 
ia presented. The algorithm is derived from the Newton-Euier equations, and uses 
meaaur emeiite of the joint torques as well as the measurement and calculation of the 
kinematics of the manipulator while it is moving. The identification equations are 
linear in the dc-sircd unknown parameters, and a modified least squires algorithm, 
is used to obtain estimate* of these parameters. Some of the parameters, however t 
are not identifiable due to the restricted motion of proximal links and the lack of 
full force/torque sensing. The: algorithm was implemented on the MTT Serial Link 
Direct Drive Arnir A good match was obtained between joint torques predicted From 
the estimated parameters and the joint torques computed from motor currents, 



1 Tkt5 chapter i* a ic vised, versicji of [An h Atktflan, ittd HaJlrrbaDk, ]9B&n) 
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4,2 Introduction 

This chapter presents a method of estimating all of the incrtial parameters, the 
mass, the center of mass, and the moments of inertia of each rigid body Imk of a 
robot manipulator using joint torque sensing. Determining these parameters from 
measurements or computer models is generally difficult and involves some approxi- 
mations to handle the complex shapes of the arm components. Typically, even the 
manufacturer;; of manipulators do not know accurate Values of these parameters. 

The degree of uncertainly in inertia! parameters is an important factor in judging 
the robustness of modal-baaed control strategics. A common objection to the com- 
puted torque methods, which, involve full dynamics computation (e.g., Luh, Walker, 
and raul, 1980b), is their sensitivity to modelling errors, and a variety of alternative 
robust controllers have been suggested (Samson, 1*53; Slotine, 1G85; Spong, Thorp, 
and KlcinwaJtE; 1984,. Gilbert and Ha + 1&B4]. Typically these robust controllers ex- 
press modelling errors as a differentia] inertia matrix and coriolis and gravity vectors, 
but in bo doing, no rational basis is provided for the source of errors or the bounds on 
errors. The error matrices and vectors combine kinematic and dynamic parameter 
errors, but kinematic calibration is sufficiently developed so that very tittle error can 
be expected in the kinematic parameters (Whitney, Lozinski, and Rourke, 1Q84). 

One aim of this work Is to place similar bounds on inertia! parameter errors by 
explicitly identifying the inertia] parameters of each link that go into the making of 
the inertia matrix and coriolis and gravity vectors. Our work in load identification 
(Atkeson, An, and Hollerbach, 1985a} suggests, for example, that mass can be accu- 
rately identified to within 1%. Therefore, an assumption of 50% error in link mass 
in verifying a robust control formulation [Spong, Thorp, and Klein waks, 1964) is an 
unreasonable basis for argument- Slotlne (1985} suggests that errors of only a few 
percent En inertia! parameters make his robust controller superior to the computed 
torque method,, but it may well be that these parameters can he identified mom 
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accurately than his assumptions. 

As an. alternative approach we propose estimating the inertial parameters on the 
basLE of direct dynamic measurements. The same algoritlnns used to identify load 
inertia! parameters (Atkeson, An, and rTollej-bach, TUSJia) can be modified to find 
iink inertia! parameters of a robot arm made up of rij;id parLs. The Xewton-Euler 
dynamic equation:? are need to express the measured forces and torques at each 
joint in terms of the product of the measured movements of the rigid body links 
and the unknown link inertial parameters. These equations arc iinear in the inertial 
parameters. However, unlike loud estimation, the only sensing IS one component 
of joint torque j inferred from motor current. Coupled with restricted movement 
near the base, it is, therefore h not possible to find alt the inertial parameters of the 
proximal links. As will be seen, these missing parameters have no eEFect on the 
control of the arm. 

In this chapter h manipulators with only revolute joints are discussed since han- 
dling prismatic joints requires only trivial modifications to the algorithm. The pro- 
posed algorithm was verified by implementation on the MIT Serial Link Direct Drive 
Arm. 

4.2.1 Previous Work 

Mayeda, Gsuka, and Kangawa (lSSi) required three sets of special test motions to 
estimate the coefficients of a closed -form Lagrangian dynamics formulation. The 10 
inertial parameters of each link are lumped into these numerous coefficients, which 
arc redundant and susceptible to numerical problems in estimation- On the Other 
hand, every coefficient, is identifiable since these coefficients represent the actual 
degrees of freedom of the rohot. By sensing torque from only one joint at a time, 
their algorithm is more susceptible to noise from transmission of dynamic civets of 
distant links, to the proximat measuring joints. For efficient dynamics computation, 
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the recursive dynamics algorithms require the link parameters explicitly. Though 

recoverable from the Lagrangian coefficients , it is tetter to estimate the inertia! 
parameters directly. Though this algorithm was implemented oft a PUMA robot, 
it is difficult to interpret the results because of dominance of the dynamic* by the 
rotor inertia and friction. 

Rukerjee and Ballard (1QSS) directly applied their Load identification method to 
link identification, by requiring full force-torque sensing at each joint. Instrumenting 
each robot link with full forte-torque sensing seems impractical, and is actually 
unnecessary given joint torque sensing about the rotation axis. Partially as a rerjlt, 
he docs not address the issue of unidentifiability of some menial parameters. Alaa, 
he did not verify his algorithm by simulation or by implementation. 

Olsen and Bekey (1965) presented a link identification procedure using joint 
torque sensing and special test motions with single joints. The unidentifiability 
of certain inertia! parameters was not resolved, and the least squares estimation 
procedure written as a generalized inverse would fail because of linear dependence of 
some of the inertia} parameters. Again,, their procedure was not tested by simulation 
or by actual implementation on a robot aim. 

Xeumanand Khosla (1965) developed a hybrid estimation procedure combining a 
Newton-Euler and a Lagrange-E uler formulation of dynamics. Simulation results for 
a three degrec-of-freedom cylindrical robot were presented, and the unidentifiability 
of certain inertia! components was addressed. For some reason they state link mass 
must he known for a linear estimation procedure,, but such a restriction does not 
Ejdat with our method. Though planning to work with the CMU DDArm II, they 
have not yet presented experimental res-ultE. 
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joint i 
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Figure 4.1: Coordinate origins and location vectors for link identification. 

4,3 Estimation Procedure 

4.3.1 Formulation of Newton-Euler Equations 

In our work in load identification (Atkeson, An, and Holterbach, l9S5a), the Newton- 
Euler equations for a rigid body toad were formulated to be linear in the unknown 
Inert [aJ parameters. Then the simple linear least squares method was used to es- 
timate those parameter*. l\y treating each link of a manipulator as a load, this 
formulation can be extended to the link estimation problem. The differences in the 
equations are that only one component of force or torque is sensed and that the 
forces and torques from distal links are summed and transmitted to the proximal 
joints- 
Consider a manipulator with tt joint* [Figure 4-1), Each link £ haa its own local 
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coordinate system P, fixed la the link with its origin at joint t". The joint force and 
torque due to the movement of its own link can be expressed by simply treating the 
link as a load and applying the previously developed equations for bad identification 
(Atkeson, An, and Hollerbach, 1955a); 

J- 



fit 







fo-S [*<x] + [v,xifw,xj 

n [(k-Pp] x 1 [■"*] + fwiX||# Wi j 
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l V*J 



or more compactly. 



(4.1) 



W[( = A;^ 

wher# w i; - la the wrench (vector of forces and torques) at joint i due to movement of 
link j alone. A,- is the kinematic matrix that describes the motion of [ink * and £,- is 
the vector of unknown link inertial parameter*. All of the quantities are expressed 
in the local joint i coordinate system, The formulation of the above Newton-Euler 
equation! were already presented in the load identification paper (Atfceson f An, and 
Hollarbach N lttfa). 

The total wrench w H at joint i is the sum of the wrenches w^ for all links j distal 

to joint i: 

if 

*V-£w« (4.2] 

f=i 

Each wrench w,- : at joint i is determined by transmitting the distal wrench vtjj across 
intermediate joints, ThEs is a function of the geometry of t lie linkage only. The forces 
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and torqnee at neighboring joints are related by 
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or mare compactly 
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w V+i =T - w *«M+i 



(4,4) 



where 



ULi= the rotation matrix rotating the link i + 1 coordinate system, to the link t 
coordinate system, 

g,-= & vector From the origin oF the link i coordinate system, to the link t + I coor- 
dinate system, and 

T^= a wrench transmission matrix. 

To obtain the forcea and torques at the i th joint due to the nuremenU; of the j** 
link, these matrices can be cascaded: 



w,- | -=T«T M ---TjW^ 

where Uy = TfTj+i * -*TjAi and XJ^ = Aj, A simple matrix expression for 
kinematic chain (in thia case a six joint arm) can be derived from {4,2} and 
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This equation is linear in the unknown parameters, but the left side is composed 
of a full force-torque vector at each joint. Since only the torque about the joint 
axis can usually he measured, each joint wrench muat be projected onto the joint 
rotation axis (typically [0 b b l] in internal coordinates), reducing (4.fi) to 



T=Ktf> 



(4.7) 



where tv = ]0 r 0, P 0, ft, l]-W; Is the joint torque of the i* h link, if> = [£ ( , £ it ^ a ., ^, £ 6 , $ U \ T , 
and K,-y = [0 K 0, 0, 0, Ojl] -U^ when the corresponding entry in (4&) is nonzero. For 
an n-link manipulator, f is a n x 1 Yector, ^ is a lOn x I vector,, and K is a n x lQn 

matrix. 



4.3.2 Estimating the Link Parameters 

Equation fi,7) represents the dynamics of the manipulator for one sample point. 
For extra data, {4.7) 5a augmented asi 
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N = number af data points 



Unfortunately, one cannot apply simple least squares estimate: 
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because K r K is not invErtihle- due to loss of rank from restricted degrees of freedom 
at the proximal links and the lack of full force-torque ifermng. 

There are several ways to resolve this problem. One way to resolve this problem is 
to use the pseudo inverse to get a solution iff to T = K$. But since Kisa potentially 
large nN x low matrix, the pseudo-inverse is computationally ineffkientr Another 
simple method similar to the pseudo inverse is to use ridge regression (Marquardt 



ftnd Snee, 1975). Ridge regression, makes K f K invertible by adding a small number 
to the diagonal elements: 

* = (K T K + dI lflH )- 1 K T ' T (4.9) 

The estimates are nearly optima] if d « A^fK^K), where A™* is the smallest 
non-zero eigenvalue of K T K. Other methods- of solution* fundamentally different 
from the above two, arc presented in the discussion sjecfcjon- 

4,4 Experimental Results 

Link estimation was implemented on the MIT Serial Link Direct Drive Arm (Figure 
4.2) > a three link serial manipulator with no transmission mechanism between I he 
motors and the links. The ideal rigid body dynamics is a good model for this arm, 
uncomplicated by joint friction or backlash typical of other manipulators. Hence the 
fidelity 0-f this manipulator's dynamic model suits estimation well. The coordinate 
system for this arm is defined in Figure 4,3, A aet of inertial parametem is available 
for the arm (Tab(e 4-1 ), determined by modeling with a- CAD /CAM database (Lee,, 
1983). These values can serve as a point of comparison for our method, but they 
may not be accurate because of modeling errors. 

The motors arc rated at 660 Nm peak torque for joint 1 and 230 Nm for joints 
2 acid 3 (Asada and Youeef-Toumi , 1984). Joint l is presently cap-able of an an- 
gular acceleration of 1150 d eg/see*, joints 2 and 3 in excess of 6O0O degfste 2 . In 
comparision, joint 1 of the PUMA COO has a peak acceleration of ISO (leg/sec* and 
joint 4 at the wrist 260 degfsec*. Joint position is measured by a revolver and joint 
velocity by a tachometer. The tachometer output is of high quality and leads to 
Rood acceleration estimates after differentiation. The accuracy of the acceleration 
estimates pins hi^h angular accelerations greatly improves inertia esti [nation. 

The joint torques are computed by measuring the currents of the 3 phase windings 
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Figure 4.2: The MIT Serial Link Direct Drive Arm 
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Figure 4.3: The link coordinate system. 
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Parameters 


Linkl 


Link 2 


Link 3 


m{Kg) 


67.13 


53.01 


19.67 


mt±{Kg -m) 


0,0 


0.0 


0.3108 


FlMy 


2.432 


3.4081 


0.0 


jnc, 


35.&25T 


16.6505 


0.3268 


I„[Kg- m *) 


23.1568 


7.9088 


0,1825 


J„ 


0.0 


0.0 


0,0 


-*™ 


-0,3145 


0.0 


-0.0166 


A. 


£0.4472 


7.6766 


0.4560 


I v . 


-L2&4S 


-L5030 


0.0 


\ 


0.7418 


O.6&07 


0.3900 



Table 4.1: CAB-rnodeled inertia] parameters. 

of each motor (Asada, Yaucef-Toumi, and Liin, 1981). For the three phase brush! 
permanent magnet motors of the direct drive arm, the output torque is related to 
the currents En the windings by: 



r = K T {T a sin S + It sin(* + 120) 4- I c sin{8 + 240)) 



(4.10) 



The torque Constant K T Tor each motor is calibrated statically by measuring the 
force produced by the motor torque at the end of a known lever arm. The force 
is measured using a Barry Wright Company Astek FS&-10A-200 6-axis force/ torque 
sensor, Asada, Yanccf-TaumE, and Lim [1984) have found that for a motor similar 
to the motors of our manipulator,, the torque versus current relationship war non- 
linear, especially for small magnitudes of torques, and also varied as a function of 
the rotor position. However, for the results presented in this paper, the nonlinear 
effects were ignored since substantial portions of the movements En the experiments 
required large magnitudes: of torques. Since the least squares algorithm minimizes 
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the square of the error s torque errors for torques of small magnitude* do not affect 
the estimates very much.. 

For the estimation results presented, 000 data points were sampled while the 
manipulator was executing 3 sets of fifth order polynomial trajectories in joint space. 
The spec ifica Lions of the trajectories were: 

1. (330, 289.1, 230) to (flO> 39.1 1 -10) degree* in L3s, 

2. (330 t 26&1, -3D) to (S0 N 19.1 1 220) degrees in 1.3a, 

3. (SO, 26ft 1 > -30] to (330,. ift.1, 220) degrees in L3a t 

Since K T K in (4.9) is singular, estimates for the 30 unknowns are computed by 
adding a small number (d - 10,0 << Ab^K^R) - 3395.0) to the diagonal elements 
of K T K. 

Typical results;, obtained using the ridge regression method, arc shown in Table 
4.2, Parameters that cannot be identified because of constrained motion near the 
base are denoted by 0,0*. The first nine parameters of the 6rst lint are not identi- 
fiable because this link has only one degree of freedom about its z axis. These nine 
parameters do not matter at all For the movement of the manipulator and thus can 
be arbitrarily set to 0.0, 

Other paramctexa marked by (f) can only be identified in Linear combinations, 
indicated explicitly in Table 4.3, The ridge regression automatically resolves the 
linear combinations in a least squares sense. Tt f.an be seen that the estimated sums 
roughiy match the corresponding sums inferred from the CAD-mode!ed parameters, 
but the sizeable discrepancy indicates that one parameter Set may be cnore accurate 
than the other. 

To verify the accuracy of the estimated and the mndelfid Tiararr^ter!:. the mea- 
sured joint torques arc compared to the torques computed from the above two sets 
of parameters using the measured joint kinematic data. As shown in Figure 4.4 p the 
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Figur* 4 A: The measured, CAD-mode]led, and the ta Limited joint torques 
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fifiti mated torques match the measured torques very closely. The torques computed 
from the CAD/CAM modeled parameters do not match the measured torques as 
closely. This comparison verifies qualitatively that for control purposes the esti- 
mated parameters: are in fact more accurate than the modeled parameters. 

4.5 Discussion 

Good estimates of the link Snertlal parameters were obtained, as determined from 
the match of predicted torques to measured torques. The potential advantage of 
this movement-based estimation procedure for increased accuracy as well as con- 
venience wag demonstrated by the less accurately predicted torques based on the 
CAD-modclcd inertia! parameters. 

There are three groups of inertial parameters: fully identifiable, completely 
unidentifiable, and identifiable In linear combinations.. Membership of a parame- 
ter in a group depends on the manipulator's particular geometry, Some link inertia! 
parameters are unidentifiable because of restricted motion near the base and the 
lack of full force-torque sensing at each joint. For the first link,, rotation is only 
possible about its z axis. Suppusc full force-torque sensing is available at joint 1, It 
can be seen from (-1,1) that /«,, 7« rtl and J^, are unidentifiable because they have 
no effect on joint torque. Since the gravity Vector is parallel to the z axis, c tl is also 
unidentifiable. If it is now supposed that only torque about the z aris ran he sensed, 
then all inertia! parameters for link i become unidentifiable except I ZZi . 

In a multi-link robot a new phenomenon arises. Some parameters can only be 
identified in linear combinations^ because proximal joints must provide the torque 
sensing to identify fully the parameters of each link. Certain parameters from distal 
links are carried down to proximal links until a link appears with a rotation axis ori- 
ented appropriately for completing Lhe identification. In between, these parameters 
appear in linear combinations with other parameters. This partial identifiability and 



the difficulty of analysis become worse as the number of links are increased. 

The ridge regression automatically resolve* the linear :oti:'.-i nations in a [east 
squares sense, which make these inert] al parameter* appear superficiary different: 
from, those derived by CAD modeling. An alternative method is generation and 
examination of the closed-form dynamics, which is a complex procedure for more 
than two degrees of freedom. 

A second alternative is a numerical analysis via singular value decomposition of 
K in (4.8) > yielding (Golub and Van Loan, 1963) 

K = UEV r 

where £ = diag^} and l? and V T are orthogonal matrices. For each column 
of V tliera corresponds a singular value ffj which if not zero indicates that linear 
combination of parameters,, vf^, is identifiable. Since K is a function only of the 
geometry of the arm and the commanded movement , it can be generated exactly 
by simulation rather than by actually moving the real arm and recording data with 
inevitable noise. 

The above two procedures isolate several sets of parameters whose linear com- 
binations within each set are identifiable. Then we can arbitrarily set all but one 
of the parameters of each set to zero and apply the estimation algorithm for the 
reduced set of fully identifiable parameters. As shown in Table 4,2, for our 3 link 
manipulators these two procedures result in grouping lhe 3D irertial parameters into 
the following categories: 

1. fully identifiable: m a e t „ m^c 9Si I mtit I att , J^, /„,, m ie±i1 I tVi> J„ a 

2. completely unidentifiable: mj, nta-e^, mi, mie, 3S miir yil mic Ml , /„,, 1,^, ;„,, 

3. identifiable in Einew combinations: m a , m jCli , / flJI 7 m , m^, /„,, I Wt> I iM2 , 





Parameters 


Link 1 


Link 2 


Links 


m(K S ) 


0.0' 


0.0' 


1.S92Q? 


mc M (K$ ■ hi) 


0,0* 


-0.1591 


0.4676 


mc ¥ 


0.0" 


0.6776? 


0.0313 


Tnc r 


0.0* 


0.0* 


-l.OOSTt 


^[Kg-m 1 ) 


0,0* 


4,15fl2f 


1.52761 


I** 


0.0' 


0,35-94 


-0.025 tt 


I tt 


0.0* 


0,0116 


0.0145 


*** 


0.0* 


5.212St 


1.89671 


0.0' 


-o.eoaot 


-0,0160 


I„ 


9.3359S-t 


-0,8 IMf 


0,3568 



Table 4.2: Estimated inertial parameters. 

The completely unidentifiable parameters can be arbitrary set to zero. For the 
linear combination parameters, the cOcnbina Lions of thfisa parameters that can be 
identified together are shown in Table 4,3, To obtain a particular solution for these 
parvimntfiTS. run:' am tut rfi ;i| r,'ijc :j ,/ ri] . and f LZi l.o zmr>. Tlir-n, [.III- rt ciiiiini n k <\ 
parameters of this category can be treated as fully identifiable parameters along 
with the parameters In the first category. Those columns oF K which correspond 
to the 15 zeroed parameters are taken out, reducing K T K to a 15 x IS full rank 
matrix. Now, simple least equares can be applied to estimate the 15 identifiable 
parameters. For our implementation experiments, the results of this method agreed, 
with the results of ridge- regression presented in the previous section. 

Although Eiot as simple as the ridge regression method, these methods art attrac- 
tive since it allows us to reformulate the dynamics in terms of only Hie identifiable 
parameters. This then can increase the efficiency of the corresponding inverse dy» 
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Linear Combinations 


Estimated 


CAD-Modcled 


m&**ti ■+ ha 


-LOSS 9 


-1.356$ 


i±i x — It/ft 


-0.3691 


-0.2702 


■"*»] ~r -^zrt 


0,7GS2 


0.8(3.12 


J«. + h** + h,t + m^\ 


15.7029 


12,3169 


***•* "■" *«* *vn 


Q.470Q 


0.4147 


"13^ — '"ity* 


-1.6863 


-3.0814 



Table 4,3 1 Parameters in linear combinations (l 2 = 0.45m,] 

namics computation for controller implementations [Holler bach and Sahar> 19S3). 
We axe still investigating these issues with eventual goals of studying the effective- 
ness oF different control algorithms using the estimated dynamic model and analysing 
thetr robustness with, modelling errors, 

4*6 Open Questions 

Statistical characterization of the errors, finding "rich* movements, quantifying de- 
sirable fcunsur properties and the importance of velocity sensiuK are important issues, 
just aa in the rigid body load parameter estimation work, 

The effect of deviations from rigid body dynamics caused by kinematic mod- 
ellirij; errors, link flexibility, and tin modelled joint or actuator dynamics needs to be 
examined. Tor the direct drive arm we have verified experimentally the usefulness 
of the rigid body dynamics model, however. By identifying the inertial parameters 
directly rather than estimating them from a GAD/CAM model or the properties of 
the individual links we inadvertently compensate for some model structural error. 
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Chapter 5 

Optimal Filtering For Parameter 
Estimation 



5.1 Introduction 

Often in parameter estimation there is a substantial amount of noise contaminating 
all of the sources of data. Furthermore ill many estimation situations there are vari- 
ables in the parameter estimation equations that arc not measured directly, but are 
known transformations of other measured data. These factors present problems for 
many parameter Estimation approaches, such as least squares. This chapter presents 
a three step approach to these problems. The first Step make? use of redundancy in 
the sensing to characterize the noise sources. The second step makes use of the noise 
characterisation to design optimal filters for the different data smirr.es. Finally, an 
eigenvalue/eigenvector based estimation procedure is used to produce the parameter 
estimates. 
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5.1.1 Problem Statement 

The typical linear parameter estimation problem is of the following form; 

«jS5i + «j*i + ■ + a„P n = (1,1) 

or eciuivalently in vector form 

with x-i being measurements, from various data sources and the a^ being the un- 
known parameters . Measurements from the data edhtcee are. each contaminated 
with afcatiatically atationaxy lero mean random noise with variance <n. 

The parameter se£ H {a^a^^ ..,&„}?, is actually redundant, as it can bu scaled 
without changing the problem, so typically either the length of the parameter vector 
is constrained or one of the parameter is tet to some arbitrarily chosen value. We 
will require that the length of the parameter vector is 1, ia.„ 

rial 

Many data points consisting at seta of observations {x lt x it .. . K r„} can bE col- 
lected. The goal here is to find a stt of parametEra, {ari,ai, r, ,,«»}, that obey any 
constraints Euch as a fixed parameter vector length, and also in some optimal sense 
provide a heat fit to the data. We will discuss what we mean by optimal In what 
follows. 

5*1*2 The Least Squares Approach 

One approach that is widely used is kcown as least squares (Silvey, 1D75). A crucial 
feature of this approach is that all but one of the data sources are assumed to be 
noise free, such that the problem is restated as follows The constraining problem 
structure is now: 

oil j a -r- ftj^i + •• ■ *f & n -jr n -i = x„. (5.4) 



where a n is arbitrarily set to —1 and 



*n = *. " * 



(5-5) 



in is the true value of t„ and. £ is the measurement error. The variance of the 
contamination noise, o Cl is zero for all i < n. AH of the noise is assumed to come 
from the variable i > which is a xero mean random variable with variance o N . The 
equivalent vector form for Equation (5,4) ii 
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If all of the p data points collected are used to build a data matrix., A 1 and an 
measurement vector, y, the following matrix equation can be written: 



A-ft -f f = y 



with the data matrix A being the px [n — l) matrix 



A = 



x u Jt Jt 

^l 2 Sj a 






V ^1, X if ." X [ft _,J, J 

and the measurement vector y being a vector of length p- 



y = 



■Tina 



The criteria to be minimized by the chosen set of parameters t {«i> Oi h , 
is the length of the error vector^ e s ie. 
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(5.8) 



(5,9) 



h ftn-]}, 



(5,10) 
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The estimate thai mir.imlzea thi* criteria is given by the normal equations 

A=(A T Aj- 1 A r y (Ml) 

This approach works well on many estimation problems, even though in the real 
world no meaauremenU are perfect. For some problems, however, the contaminating 
noise on the different data sources Is substantia], and cannot be ignored. In thia ca.se, 
a different approach is necessary, 

5*1*3 Motivation For A Different Approach: A Typical Es- 
timation Problem 

Recently, we have addressed the problem of estimating the inertial parameter* of a. 
rigid body load being manipulated by a robot equipped with a wrist force/ torque 
sensor (Atkeson, An and Hollerbach, l9S&a n 108Sb). We were able to formulate the 
rigid body dynamics, in the following form: 

m 



' u ' 
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n* 
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. n * . 





p-g; [ux]+ |«x][wx] 



(B.12) 
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hi 

hi 
hi 

In 
lu 
J» 

The derivation af this equation and a definition of the notation lb provided in the 
Appendix. The important point is. that this problem can he formulated as a. linear 
parameter estimation problem, In that known parameters enter non-linearly, but 
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the unknown parameters only enter linearly in the determination of the forces and 
torques on the load. Equation {,5.12} can he more compactly written as 

w = M^ (5,13) 

where w is a wrench vector, a vector or three forces and three torques, M is a 
matrix describing the movement of the load, and ^ is the vector of unknown inertial 
parameters. 

We noted that the unknown parameters appealed El nearly in the problem, and 
therefore we thought that the least squares approach to estimating the parameters 
could be used. In order to apply the least squares approach to this problem we had 
to measure the robot position, velocity, and acceleration, and the forces and torques 
on the wrist sensoT. Furthermore! the position, velocity, and acceleration must 
he measured perfectly^ Unfortunately, this was not a reasonable mo Jet of reality. 
The robot was only Equipped with portion, sensors, and we had to numerically 
differentiate the position data tn find the velocit3 r and acceleration. To do this we 
used a discrete differentiating filter convolved with a low pass filter (Hamming, 1977). 
A typical sample of the data IS shown in Figure S.l. The differentiation proceES 
amplified whatever noise was in the positit;:: reLord, leaving the derived acceleration 
record greatly contaminated with Jioise. The force and torque data was also clearly 
contaminated with noise (Figure S.£]. 

Thus, this problem did not easily fit into the standard least squares framework, 
because all of the data sources ha*e substantial contaminating noiee. 

One possible approach to handling the noise is to filter the data. However, 
whatever filter that is applied must preserve the relationship between the different 
data sources, and we nmert *o know which filter gives ua the best parameter estimates. 
The standard least squares parameter estimation approach gave ns no guidance as 
to how to filter the data to achieve these goals. 

Another approach to this problem is to symbolically integrate the load param- 
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Figure 5 A: Example of measured joint ang^ calculated joint angular velocity, and 
calculated joint angular acceleration. 
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Figure 5.2: Example of measured Forces and torques. 



ctcr estimation equation, Equation [5.12) a so as to avoid d ifferentiating velocity to 

find accelerations. One approach to integrating the equations is presented In the 

appendix to this chapter, and w* tan express the resulting estimation equations in 

matrix form m: 

m 

fne, 
■■-- Jn 

r a 
r» 

h% 
/ 3a 






L( 



Mdi 



(5.H) 



S5 



here the first row of [j/ +r M dr] is 
p +/ wxpdr-f/ Rdr s g 



« / J v* 



[15} 



and the second row is 






(le) 



It Is not dear j however, that this is. the right thing to da. If there is substantial 
low frequency noise or bias in the data the integration will amplify that noise relative 

ta the signal frequencies of the data. What is needed Is a problem torrnulatlan that 
gives us guidance as to how to design data processing filters for estimation and how 
ta handle the need to differentiate or integrate some of the data, to Eupply missing 
measurements, 

E.:L4 Prototype Problem To Be Discussed As Example Of 
Procedure 

The urublum we are: goiir.g to Fojvjs orv i". what : r.\' rwv- '■p. the load JEierciai ;jaraj:nt'.cr 
estimation problem stripped down to its bare essentials. We have a one-dimensional 
system, and its dynamics are given hy r r 



f — nx • a 



[fi.ll] 



which is simply force = maw acceleration. 

Our goal is ta estimate the mass of the object e& accurately as- possible, given a 
fixed amount af data. Unfortunately, we only measure the Force and the velocity 



*G 



of the object. We wish to determine the data processing procedure to that, produces 
the best estimates, in some sense. We have several choices: 



Differentiate velocity to estimate acceleration, and use the equation 

d 

to produce the estimates. 



m=m^-v(t) (&,lg) 



Integrate the fcfet> and use the equation 

rl, 

'if 

to produce the estimates. 



f f f(t)4t = ryi{v(t f ) - v(U)) (5.19) 



* Differentiate velocity to estimate acceleration, bat then apply a fitter of some 

30ft to "clean Up* 1 the data. The estimates are produced from an equation of 
the form 

i*f(t)=m(t*^)*v{t) £5.20) 

where I is som* filter and. + is the convolution operator. 

The goal of this chapter is to show which of these choices is the best choice to 
make, and ahow what factors affect the choice OF data processing procedure. To do 
this,, we will start by explaining how an estimation procedure that can handle noise 
in all the data work5r We will first show how Standard least squares is applied, and 
we want to look carefully at what criteria it optimizes and how it handles colored 
noise. Next we shall examine an eigenvalue/ eigenvector estimation procedure, how 
it applies to noise on all database, and how it handles colored noise. We will explain 
how the colored noise estimator can he partitioned into a white noise estimator 
preceded by appropriate data filters. Then we will ask what data processing produces 
the best estimates from this estimation procedure, and then we will show how to 
generate the information about the data and the noise that allows us to design the 
appropriate data processing procedures, 
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5.2 The Estimation Procedure 

We will now diacuaa application of the estimation procedure to the prototype prob- 
lem, estimating the mass of a one-dimenstona! load. First, we must describe notation 
for al] the quantities we need to keep tract of r f{t), a[t)i and v[i) are measured or 
estimated forces, accelerations, and velocities, /(i), *&(*)„ and v{t) are the underly- 
ing true rallies for these quantities, m is the true mass of the object, and m 3a the 
estimated rnasa. Note that 

/(f) = m « a{t) = m - —v(t) (5,21) 

holds for the true values of the data, but ain.ee 

/(0 - /(*) + M*) (5.22) 

a{t) = 5(*) + n«(f) (5,23) 

v[t) = v(t) + n v (t) (5.34) 

where tif[t), n a (t] t and n^t) are the noise contaminating the data, the corresponding 
equalities for the measured data do not hold: 

j 

f(t)?m-e[t)?m-- v [t) [5.25) 

5*2*1 Application Of Standard Least Squares To Prototype 
Problem 

What is presented in this section is well known (Sifrey, 1975, for example) and ia 
presented so as to make what follows more comprehensible. 

To show how to appiy standard leaat squares to this problem,, wc mutt assume 
we have a perfect acceleration measurement, a(t) = a(t)i but a noise contaminated 
force measurement, J(t) = miff) -»/{*). We wi]] mode] the noise t n/(l), as zero 



mean, gausslan random noise. n f (t] is not necessarily "white 11 noise, however, but 
can have any power spectrum, denoted by S H MJ). 

Wc can estimate m. using standard least squares in the following way. In the 
white noise case, £y(**0 ~ *i ^e appropriate criteria to minimize lb 






where 



z(t)=m.<i(t}-f(t) 

We express the data in matrix/vector form A^ = f + v. 



{i.te) 



(5.27) 



a{2) 



i. a (p) > 



(-)- 





+ 


<2) 


L /W j 




t f (*J J 



{5.23) 



and the solution Ls given by the normal equations 



i=(A T A)- I A r f 



(5.29) 



The; standard least squares approach minimizes errors aEong only one coordinate 
direction (Figure 5.3A). This should be contrasted with the eigenvalue/ eigenvector 
approach, 

Effect Of Colored Noise 

The effect of colored noise (the noise spectrum S n{ is not constant) Is to lead us 
to weight the errors in our criteria for an optimal estimate. To show this we must 
analyse how our parameter eEtimatlon Schemes work in the frequency domain. 
Since 

e(t) = m ■ fl (() - /(*} = n/ (!), (5,30) 
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Figure 5.3: A 




Figure 5,3: R 
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the pOWCr spectrum of the noise, S g , is simply the power Spectrum of the noise 
contaminating the force measurements, S nf . In the whit* noise case (the noise 
spectrum S ri is. constant) we wanted to miniiniiee the criteria 

E' a M = £[««■»£*) -/(i)) 1 (s.sij 

and by FaratvaL's Theorem (Oppenheim and WE3sky T IG83) 

r*'(o = e i^c«)i a (5.32) 

where E{o>) is the Fourier transform of f{t). Since the expected spectrum of the noise 
was Flat, it was reasonable to weight the error components at different freqjencies 
equally. 

In the colored noise case weighting the errors equally at different frequencies no 
longer makes sense. Since the power spectrum of the noise Is S„ f {w) and is not 
constant, we should weight the error components at different Frequencies according 
to the expected amount of error at that frequency. Our optimal estimation criteria 
to minimize becomes [iu the frequency domain): 

This is equivalent to prefittering the data with the frequency domain filter 



or convolving the data with the inverse Fourier transform of this filter. 

5.2.2 Handling Noise On AH Measurements 

The eigenvalue/eigenvector parameter estimation procedure presented in this section 
was originally developed by [Kaopmaiw, 1937),, and (Levin s 1964) made use of it in 
pulse transfer function estimation, Man? others have extended it in various ways: 
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(Smith, 1368, BonLvento and Guidorsi, 19Tl p Grosjean and Foulard, 191$, Kotta, 
1979, Tian-Zing, et- al., 19S2, Kotta, 19&2) 

Now let us assume there is noise on both the foice, n/, and the acceleration n it 
measurement: 

a(r] =a(t}-n„(t) (5.36) 

Both noise sources are sero mean, gaussian, and with power spectra S Af and S^ 
respectively. The noise sources, are independent of each other, also. Initially, let us 
assume that variance of n a and the variance of »/ are equal, and the noise source* 
are both, whlti, 

Nnw we need la reexamine what the errar is that we are minimizing, We see in 
Figure B.3B that the error we now want to minimize ii in the direction of the vector 



and for a data point (a(t) ,J{t)) T the error is given by 



(U.3T) 



■- . -'"-/'"..,. M 



(--)(:,)) 



This is the component of the "tnie* error perpendicular to the relationship f[t) = 
tfi ■ <t{t) . 

We would like the estimate m to minimize 

t i m -f l 

This ia a nonlinear minimization problem since the desired estimate, m appears in 
both the numerator and the denominator of the objective function, and is squared 
In the denominator. Thus h we cannot apply standard least squares techniques to 
this problem, or use the normal equations to find the estimate. 
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An Eij5«nvaluc/ESeeJiv(ictoT Based Estimation Procedure 

We can solve this nonlinear estimation problem if we think of it as a linear algebra 
problem (Koopmana, 193?) . We expresa tf t) as a vector: 



l °(p) /(P) j 




(A* + 1)1/5 



.(2) 

■ 



[SAG] 



and notice that the above forms the matrix/vector equation: 



Bv = c 



(5,41) 



Note that v has unit length. 

Our estimation problem ia now to find the unit length vector v that minimises 
the length of By. This wilt minimize 



E*'M 



{5.42} 



at we deaire. Now that WC have reformulated thl estimation problem as a linear 

algebra problem, we can recognize that the desired y is simply the right eigenvector 
of B corresponding to the smallest Eingular value of B 1 or equivalent^ V is; iLe right 
eigenvector corresponding to the smallest eigenvalue of B r B [Golub and Van Loan, 
1983). To see this, we realize that minimizing the square of the errors is equivalent 

'.o Tn\T\\r\r.7.\::x 

2<'W=* T D" , B* [5.43) 

W* merely decompose a candidat* if into Its projection along the orthogonal eigen- 
vectors of B r D , i.e., 

v - Tfi Vi + 7jVj + - ■ + "j p v f (S.44) 
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where v t Is the eigenvector corresponding to the smalieat eigenvalue A T , Since v Is 
of unit length, DL] T? = !■ The function to minimize ia now 

v T B r Bv = A*t? +■ AJtJ + - - -f AJtJ (*,45) 

Td minimize this Bum we should choose v = Vi, m previously stated- m can be 
easily computed from * f although it may he hard to find the error distribution of rH 
given the error distribution in v, 

Effect Of Colored Noise 

The effect of colored noise is ante again to lead us to weight the errors In our criteria 
for an optimal estimate. We once, again look at the expected power spectrum of the 
noise: 

EW^-E 1 "-^;^""' (*•«) 

We note that the expected power spectrum of the noise is 

and once again we should weight the errors. Since the denominator in the above 
expression for the power spectrum is a, constant, it can be ignored in the weighting. 
The estimate rrt should minimize 

m a + I 
This 13 equivalent to prefilt Bring the data with the frequency domain filter 

1 (5.49) 

v /m2£ n . + S ni 

We note that our data filter we use to estimate ih depends Dn the true value of 
the unknown parameter, m. We therefore arc forced either to adopt an iterative es- 
timation procedure, or to simply approximate our filters to the theoretically optimal 
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filters. We chow the second course,, as it is not clear we will know the noise spectra 
exactly enough la know the true data filter? to the accuracy where approximating 
m would make a difference. 

Handling Missing Measurements 

When there are data that appear in the parameter estimation equations but are 
not measured directly we must generate that raiaaing data. This requires that we 
measure some other data that is directly related to the missing, data by some known 
transformation. We apply that known transformation to the measured data to es- 
timate the missing data,, and we also need to transform the noise power spectrum 
of the measured data to generate the noise power spectrum of the missing data, In 
this example wc reconstruct acceleration from velocity mea-iurementSr 

Conclusions On Our Estimation Procedure 

1. When there is substantial noise on more than one source of data It is appro- 
priate to use an eigenvalue/ eigenvector based parameter estimation algorithm, 

2. The optimal filter (in the frequency domain) for the prototype problem is given 
by 



3. To design the optimal filter we need to know noifve characteristics such as the 
power spectra 5^ and S„ t . 

4. The true Optimal filter depends on the true values of the parameters Co be 
estimated. We are forced either to iterate or to approximate, and we choose 
approximation in this chapter, 

5. This estimation approach can easily be generalized to apply to more complex 
problems with more titan two [Measurements. 
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6. Missing data and its iwom spectrum ahouiid be reconstructed from other inea- 
srand data if possible. 

5*3 A Filter Design Example 

Figure 5,4 snows an example parameter estimation problem and the resuming optimal 
filters. 

We present a simple example to clarify the concepts presented in the previous sec- 
tions. Force and velocity are measured as in the prototypical problem^ and t;-.e noise 
on each is indep anient zero mean white gausaian noise with variance 1. Therefore 
their power spectra are simply constants. We note that differentiating the velocity 
to estimate the acceleration will result in an acceleration noise spectrum equal to 
«A The true value of the maas parameter 5a assumed to be 1 in this example-, ea the 
frequency domain data filter is 

11 1 x 

(5.51) 



The frequency domain force filter is 

1 
; a ;.— (5.52) 

while the frequency domain velocity filter combines a differentiating filter with the 
data filter. The magnitude of the velocity filter is 

and the phaaa of the filter is a 90* phase advance at all frequencies, due to the 
differentiator. 

Lat us examine what happens to the filters as the frequency, w_ goes to iero and 
infinity, As the frequency goes to 0, the force filter becomes 
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and the velocity filter becomes 



lim -tt^ -• w (5.55) 

in the frequency domain. What 1a happening is at low frequencies the optimal filters 
essentially just differentiate velocity, and do nothing to the force signal. As the 
Frequency goes to oo t the farce filter becomes 



and the velocity filter becomes 



llm I -, I (5,56] 



lim , ^ -* 1 (5.57) 

l^-HW Vc^ + l 



in the frequency domain. What is happening here ia at high frequencies the optimal 
Altera essentially do nothing to the velocity signal, and integrate the force signal. 

We see from the above limits and from the magnitude plots of the optimal fil- 
ters in Figure £.4 that the optimal fitters, avoid differentiating the velocity at high 
frequencies, where noise would be greatly amplirled h but they also avoid integrating 
the force it Jaw frequencies, where hi as and drift would be greatly amplified (Figure 
5.5). 

The details of the filters depend an the noise Epectra and the a prior! estimates. 
of the true values of the unknown parameters, but the filters make intuitive sense 
in what they are trying to do. We note that getting the acceleration noise to aero 
tells US that the data filter should be just as the standard least squares approach 
predicts, 

5.4 Characterizing The Noise 

We see that in order to design optimal filters far the data we need to know the power 
spectra of the noise. Given that in real life one never knows the true value of the 
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frequency 



high frequency 
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Figure 5.5; Example Filters ComparEd. 



signal, but can only look at a noisy meuurement, it seems, unrealistic to expect to 
be able to find the noise power spectra. Howe vet, with no more ser.si.nR than Lhe 
sensing we havs already pusLulated> we can estimate the noiae spectra of the data. 
We simply make use of the redundancy in GHe sensing. 
Let us assume we measure force and velocity 

/M -/<*) + ■/ M (5 5s) 

v{t) = v[t) + Tt v (t) [&J&9) 

We will let the true value of the mass of the object be 1 for this discussion, so that 
/(0 — di^CO* although it is quite easy ta derive the same results far the general 
case. We assume that nf[t) and n*(t) are aero mean, stationary, and independent. 
Let us examine what the power spectra oF the observed data will be. First we 
note that the transfer function from velocity to force Is 

The power spectrum of the true force signal will be related to the power spectra of 
the true velocity signal by 

SjW\ = H[jw)H*{Ju)SM Ml) 

and the true cross spectra will be 

fy(«) = H(MS^) - -^Jy (5.62) 

The spectra and cross spectra of the actual data can now be easily found: 

S t (w)^S f [u)+S^(u) (5.63) 

S r (w)=S 5 (w) + aU W ) (5,64) 

tf„H - HU^S-^) = ^j^ (5.65) 
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This provides us with enough information to estimate the noise spectra: 

| n ,(w) = S f {w) - $vf[v)H* (jw) (5.66) 

^H-5,M-|M (5.67) 

Wc have implemented this approach to noise characterisation on simulated data 
and noise, and it seems to perform well. We have yet to test it on real data or actual 
noise. 

Conclusions On Noise Characterization 

1. W* can use "redundant" sensing to characterize noise spectra. 

'L. Wc need :<j know \\w; :rue \\-.A\:v.-j'\.\\& relationship het.wfifin the .'f-i: 1 1 n ii a n t s'g 
nale. In this case, we needed to know; 

* The true mass m. 

* I[t) = ma{t) 

* *(0 = i*W 

3. ThU approach to noise characterization generalizes to many other redundant 
Reusing situations. 

r>Ji Discussion 

One may ask whether the additional complexity in data processing suggested by this 
chapter is worth the effort. We arc in the process of applying this scheme to a Teal 
parameter estimation problem with data from actual hardware. This will enable US 
to more COncreteJy assess the procedure's benefits. 

We should point out t however> that one mates data filtering decisions all the 
time. If we numerically differentiate data we also low pass filter it H and must choose 
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the characteristics, of the filter. If we integrate data, we must choose integral ion 
intervals, which set the Jilt Bring characteristics of the integrator. We should at least 
be aware of what good filtering decision* look like, even though we may r.ot apply 
the optimal filters directly, 

We can ask whether the filtering that is suggested here makes intuitive sense. 
Clearly, the noise matters in designing a filter,. Tf there is almost no noise in a certain 
frequency range we should pay attention to those frequencies, while if the noise Is 
much Larger than any signal at some frequencies we thould probably ignore those 
frequencies. This is exactly what the fiker tells us tD do. Also, the decision of when 
to Integrate versus when to differentiate data seems reasonable, as explained in the 
filter design example section. 

We ace solving a nonlinear estimation problem, and one would have expected an 
Iterative yro-ce^ somewhere. Note that the computation of eigenvalues a:id eigen- 
vectors, or singular values and singular vectors require an iterative algorithm, and 
this is where the iteration occurs. Such eigenvahie/singular value computation pro- 
cedures are extremely well developed and a lot of effort has been put in to making 
them numerically stable [Golub and Van Loan, 1953). 

5,6 Conclusions 

We have developed a parameter estimation procedure that shows us appropriate 
ways to filter data for parameter estimation. The procedure addresses two main 
problems: 

1. Noise is typically present in all data. 

2. Often we must reconstruct missing measurements. 
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r>.7 Appendix 

5.7.1 Deriving The Dynamics Equations 

The equations for the forces and torques (expressed in the load coordinate system) 

necessary to move a load along a given trajectory are 



f = mp - mg 4- w x mc + tit v. [vx mcj 
b = Iw + wx [lur] + mc X p - mc x g 



(5.6S) 



where: 



f = the net forte on the load, 
m = the mass of the Joad, 
p — the acceleration of the load, 
g = the gravity vector (g = [0,0, -$,& metere/sec'j), 
w = the angular velocity vector, 
W = the angular acceleration vector, 
c = the unknown location of the center of mass of the load, 

relative to the force sensing coordinate system origin, 
n = the net torque on the toad, and 
I = the moment of inertia tensor of the load about the 
force sensing coordinate system Origin. 
In order to formulate the above equations as a system of linear equatione, the 
following notation Is UJSed; 



[A3 X C = 







\J. 



CI -W; 



-w r w t 



D 



[** 



A I 1 



(5,70) 
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{5.71) 



where 

i* f /„, '*■ t 5 " 72 ) 

y j J 

-'a* -'jjj i ii 

Using these expressions, Equations (5 .£9) and (5.69) can be written as a single matrix 
equation expressed in load coordinate frame. This load dynamics formulation is 
presented in Equation (5.12). 



5.T.2 Integrating The Forces And Torquea In The Force Sen- 
sor Frame 

Throughout this- derivation we will use the superscript notation * and p to indicate 
what coordinate frame a vector is- expressed in, if there is any confusion. If there Lb 
no such superscript, the vector is expressed in the load frame. 

The term p lb integrated using equation 7.22 of (Symon f 1971); 



R 1 CD P )= ^ R '^ +R[DWX ^ 



[5.73] 



were H is the rotation matrix representing the rotation Iran; an inert! ai coordinate 
system to the force sensing coordinate system that continuously moves with the ioad. 
To get an integral form of thia equation we simply integrate it: 

H^T t+T tt+T 



it t -f* 



(5.74) 



1C4 



and performing the indicated rotations leaves us with 

rt+T t+T fl+T 



ft+T t+T fl+T 

J *p dr = F p + / P U x Pp dr 
J t t h 



Simitarty 



ft+T iH-r r t T 

Jt it J* 



p t*f dr = "u 






since w x w = 0. We also remember thai 



fl+T ( fl+T \ 



(5.75) 



(5.76) 



[&.77J 



if+T" 1 



Each of the matrices [ F tLix] and [tu| tan be integrated element by element to 
show that 

jf **P<frx] rff = (/ f+r 'w *) x] - [ W"] X [S 78) 

The matrix (g - pjx] can be integrated element by Element in the Same way. Each 
matrix element of the terms [•"wxlp^x; and [*«*:][#'«] are numerically integrated 
by adding values at each time step, The result of this integration is given in Equa- 
tion (5-14). 



QJ 



(5.79) 
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Chapter 6 



Single Trajectory Learning 



3*1 Abstract 

We present an algorithm enabling a robot that Is repeating the tame trajectory to 
reduce its trajectory error on repetition of the motion. The algorithm, details how 
to use the trajectory following error {the time history of the differencea between the 
desired trajectory and the actually executed trajectory) after any particular move- 
ment to update an actuator feedforward command that is used for the subsequent 
attempt. This approach to robot learning ia based on explicit modeling of the robots 
and. uses an inverse of the robot model as the learning operator which processes 
the trajectory errors. Results are presented from a successful Implementation of this 
procedure on the MIT Serial Link Direct Drive Arm. TJie major point of this, chapter 
is that more accurate robot models improve trajectory learning performance, and 
learning algorithms do not reduce the need for good models in robot control. 

1 Ttia chapter u a. r$v|M vinfan of (Ackticn ind Mclutyt^ 
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6*2 Introduction 

Adaptive feedforward control fur repetitive motions: In actual use robots 
Lend to execute the same sequence of motions with the same loads repeatedly. We 
can Lake advantage of this pattern of usa};e by specializing the robot control system 
to store feedforward commands in a memory and play them back when necessary. 
This type of control system would repeat its errors on each movement, in contrast 
to the performance improvement with practice seen in human movement control. 
We propose an algorithm that ^ses practice to Improve movement execution, by 
altering the stored feedforward commands on the basis of previous movement errors. 
This serves two purposes: 1) to improve robot trajectory following for repetitive 
movements, and 2) to increase our understanding of the rale- of practice in human 
motor control, 

6.2.1 Previous Work 

Recent work in a number of laboratories has focused on how to refine feedforward 
commands for repetitive movements on the basis of previous movement errors. The 
first paper on repeated trajectory learning seems to have been (Uchlyama, 1973). 
Subsequent work includes (Arinioto, 1985; Arlmoto et, a)., 1954a, 1984b f 1984c , 
1985:, Hara, Omata, and Nafcano, 1985; Kawamura et, a!,, lftS-4, 19&!>; Casalino 
and Gambardella, 398(3; Craig, 1984, 19S3; Furuta and Yarnakita., 19&6; Harokopus, 
1986a, 1986b; Mita and Kato, 1985 [ Morita, 19S6; Togai and Yamano, lOSS, ISflo; 
Wang, 1984; Wang and Horowitz, 1985) 

These papers dlSCUSS using linear learning operatora and have emphasiz-cd stabil- 
ity of the proposed algorithms. There has been little work emphasising performance,. 
Le, the convergence rate of the algorithm. Simulations of several of these algorithms 
have revealed very slow convergence and large sensitivity to disturbances and sensor 
and actuator noise, 
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6,2.2 Features Of This Work 

What distinguishes this trajectory learning algorithm from previous robot trajectory 
learning scheme* is the following tomb [nation: 

Provides guidance for designing the learning operator; The central prob- 
lem in making use of trajectory error measurements- is transforming those errors into 
feedforward command corrections. Previous work has explored the requirements on 
the learning operator Tor convergence, but has given Little guidance as to bow one 
should choose a particular learning operator from the large st: 03" burning, operators 
that converge, The algorithm proposed here explicitly uses an inverse plant mode! 
as the learning operator, If there are no disturbances or sensor or actuator noise 
and we have perfect model* of the robot,, the algorithm will correct any errors in the 
feedforward command after one practice motion. 

Handles nonlinear plant*! The algorithm can make use of the nonlinear rigid 
body robot dynamics in the model used to generate feedforward command correc- 
tions Previous work have used linearized or otherwise simplified robot models. 

Make* explicit use of feedback control: The algorithm takes explicit ad- 
vantage of the on-line trajectory improvement provided by the feedback controller 
in calculating the next feedforward command. 

Successful Implementation: The algorithm has been implemented on an ac- 
tual robot, the MIT Serial Link Direct Drive Arm, Excellent trajectory learning 
performance requiring onEy a small number of practice trials has been achieved. 

Generality; works with many feedback controllers: The algorithm ap- 
plies to a wide variety of feedback controller structures, as only knowledge of the 
feedback controller output is required, Tt would be easy to combine this adaptive 
feedforward control algorithm with adaptive feedback controllers. 

Generality: works with many plants-: The algorithm does not require the 
plant dynamics to be of a particular type and applies to a wide range of robots 
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with (for example): actuator dynamics, joint compliance dynamics, and flexible link 
dynamicSr This is an important feature of the algorithm, as it Is bee timing clear that 
a typical robot exhibits quite complex dynamics (Sweet and Good, 1985}. 

Analysis relevant to other schemes: The analysis of this algorithm makes 
clear why other trajectory improvement schemes that intuitively seem correct often 
perform badly or actually degrade performance in practice. 

6.3 The Trajectory Learning Algorithm 

6.3.1 The Control Problem 

Our goal is to repetitively execute an unrestrained robot trajectory with as small 
ft trajectory following error as possible, The desired trajectory has a finite time 
duration and the robot starts in a known initial steady state, x(0) . We assume 
that modelling errors are much larger than any sensor and actuator noise or plant 
disturbances, and save the question of how to appropriately filter out non-repeatablc 
noise and disturbances for a later paper. 

There are three components of the trajectory learning algorithm: feedforward 
command initialization, movement execution, and feedforward command modifica- 
tion (Figure 6.1). After initializing the feedforward command, the movement exe- 
cution and feedforward command modification steps are executed for each repeated 
movement attempt. 

6.3.2 Feedforward Command Initialization 

Giv*n the desired ttaLc trajectory specification, 5tj(i), the feedforward command 
memory is initialized using a model of the inverse of the robot dynamics (Fig- 
ure 6.1A): 

u* t (t)= Arctium C&« 
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Figure 6,1; Block diagrams for feedforward command initialization, execution, and 
modification. 
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Using Inverse mortals of robots to seneratfi feedforward control signals: 
The robot itself transforms a vector of actuator eoflicTiaiLds, u, into motion t x 

x = fl(x,u] (6.2) 

X represents the state vector of the robot, which includes the position and velocity 
of each joint, x is the derivative of x. A model of the inverse of the robot dynamics 
transforms a trajectory specification, x((J, into actuator commands necessary to 
achieve the desired motion: 

u= J R" 1 (x,x) (6.3) 

For robot arms a rigid body dynamics model is often used to predict the forces and 
torques necessary to achieve a particular motion, and thus serves as a model of the 
inverted robot dynamics. The rigid body dynamics equations for a robot can be 
written as 

£ _1 (x , x) = Tbt^ues = Iff) $ + $ C[t) 9 + g(*) (6, 4} 

where 8(l) is the desired trajectory of the joint angles, 1(0) is the inertia matrix of the 
arm, G($] is the GoriolLs and centripetal force tensor, and g{#) is the gravitational 
force vector (Hollerbach, lftS4). For some robots it is argued that additional sources 
of dynamics are important (Gooi\, l$85a; Sweet and Good, 10S5; Good, Sweet, and 
Stroebeij 19&5). In these cases we can still model the robot dynamics and Invert the 
model. For the purposes of this chapter we will use only rigid body robot models, 
a? these types of inodelE describe most of the dynamics seen in a direct drive robot 
kirj[L. 

Generating the robot model: In order to use a rigid body robot model the iner- 
tial parameters oF the robot must be known, There are several sources of pararoc-Lcr 
estimates: 1} CAD/CAM models may be used to compute inertial parameter esti- 
mates, 2) the robot can be disassembled and the inertial parameters of the parts 
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measured, or 3} the inertial parameters can be estimated using movement and actua- 
tor data frdrfl actual use of the robot. In previous work we have taken the parameter 
estimation approach and shown that the dynamics are linear in the unknown iner- 
tia! parameters of the links, We have identified the rigid body dynamics of the MIT 
Serial Link Direct Drive Robot Arm, and have used this dynamic model far control 
(An, Atkeson, and Hollerbach, 1985), 

6.3.3 Movement Execution: 

The t'th attempt at the desired movement is executed using the current feedforward 
command, *!%(*) [Figure 6.1B}, This command includes an error, Sv.\ f (t) t which is 
reduced by the actions of a suitably designed feedback controller^ O t leaving an error, 
^u'(f), in the actuator commands aent to the robot, u'[£). For a linear plant and 
controller the amount of error reduction can be expressed using Laplace transforms; 

The actuator commands and ths actually executed movement trajectory, x'(tj, are 
stored for use by the learning module. 

6.3.4 Feedforward Command Modification 

The previous steps in this algorithm are standard components of many current robot 
controllers. One contribution of this work is to propose a particular approach to 
transforming irajectary errors into modifications of the feedforward command. We 
represent errors in our dynamic, models of the robot as an input command distur- 
bance, which we can correct for by modifying the feedforward command. We use 
our models of the Inverted robot dynamics to estimate this actuator command error, 
Sv. (t) t which is then subtracted from the previously applied actuator commands to 
form the feedforward command for the next movement attempt (Figure G.lC). 
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EatiinatirLi! the actuator command error 

We assume there exHts a. correct actuator command history, n*(i), that drives the 
plant exactly along the desired trajectory in the absence of disturbances or noise, 
and define the current actuator command error as 

where R _1 () ig the true inverse- rohvt dynamics* Oar estimate of the error in the cur- 
runt actuator command is found by simply replacing the true invef.se robot dynamica 
with our model: 

When the plant is linear we can apply the plant inverse directly to the trajectory 
error, 

fi _1 (x h i) - A" 1 [x j5 Xj) = £-'(*, e) (6.S) 

The command error est] mate can be expressed in terms of ihe feedforward command 
error using Laplace transibruis, 

Updating the feedforward command 

The update for the feedforward command on the next movement is simply the mod- 
ified actuator command. 
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By aubtr acting the correct command,, u", from both tidea of Equation (6.11) and 
using Equation* (6.5) and [6,10) we cats express the Laplace transformed error prop- 
agation equation. Tor the linear case: 

With a perfect mode], i£ = i£> the feedforward command errors wilt be zero after 
one movement. 

6*4 An Implementation Of The Algorithm 

We have implemented the trajectory learning algorithm on the MIT Serial Link 
Direct Drive Arm. This nhres joint arm is described in (An, Atkeson, andHollErbach,, 
1985). To explore the effectiveness of our learning algorithm we will present results 
on learr.ir.g a particular trajectory, 

Discrete vs. Continuous Time: Since we are using a computer to generate 
the control signals, all signals and controlled systems were represented in discrete 
time rather than continuous Lime. 

The Te*t Trajectory: All three joints of the Direct Drive Aran, were commanded 
to Fallow a fifth order polynomial trajectory with zero initiaE and final velocities and 
accelerations and a 1.5 second duration. Figure 6.2 shows the shape of the trajectory 
for each jaint, and Table 6.1 gives the initial and final jotnt positions h the peak joint 
velocities, and the peak joint accelerations, 

The Feedback Controller: An independent digital feedback controller was 
implernefitEd far each joint and was not modified during learning. 

Initialization Of The Feedforward Command: The initial feedforward 
torque* were generated f rain a rigid body dynamics mrcrffi], The model and the 
estimation of its parameters are described in (An a Atkeson,. and Ho]] Broach, 19&5). 
The calculated feedforward torques arc shown in Figure 6.3A. 
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Table G-L Test trajectory parameters. 
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Initial Trajectory Performance: As an index of trajectory following perfor- 
mance the velocity errors (the difference between the actual joint verity and the 
desired joint velocity) for the first movement are shown in Figure 6.4A.. Wg have plot- 
ted the raw velocity error data to give an idea of the relative size of the trajectory 
errors and scrusor noise. 

Calculating Acceleration and Filtering! In order to use the rigid body 
inverse dynamics model to compute joint torques it wag necessary to compute the 
joint accelerations. Joint positions and velocities were measured directly. A digital 
differentiating filter combined with an 8Hl, low pass TLter WSJ applied to the velocity 
data to estimate accelerations. 

To reject noise and non-repeatable disturbances it 3fl necessary to filter the trajec- 
tory errors and controller output to improve the convergence of the learning process. 
in this implementation we applied low pass digital filters with an 8Hz* cutoff to the 
data used in the learning process. We filtered the references used by the Learning 
operator with, the same filter used on the data. It was necessary to correct for incon- 
sistencies between the velocity sensors and the position measurement^ which was 
done by calibrating the position reference to the feedback controller* 

Final Trajectory Fsrformauce: The robot executed two additional training 
movements which are not shown, and its performance on the fourth attempt of the 
test trajectory was assessed. Figure 6.3B shows the modified feedforward commands 
used on the fourth movement, and should be compared with the predicted torques 
shown in Figure 6.3A. Figure: 6.413 shows the velocity errors foj the fourth movemen'., 
and should be compared with the initial movement velocity errors in Figure 6.-4A- 
There has been a substantial reduction in trajectory following error after Only thre* 
practice movements. 
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6.5 Using Simplified Models 

It may seem unnecessary to nse the full rigid body dynamic.* model of the robot in the 
learning algorithm. One might think that learning algorithms of this type allow one 
to avoid modeling the robot in fall detail, This is not the case. Simplifying the robot 
model necessarily introduces additional modeling error. Without careful analysis 
such modeling errors may cause the learning algorithm to have poor performance, 
or even to degrade performance. As an illustration of the possible effects of modeling 
error due to the use of simplified models,, we will now present a seemingly reasonable 
simplified model of a two joint robot arm that when used as the learning operator 
fails to learn. The robot arm is a planar two link mechanism with rotary joints and 
J9 described in the introduction to (Brady, ct aL, 19S2.) 

The simplified dynamic model that we have chosen for this example is that of 
two independent rotary joints with constant moments of inertia. That is, we ignore 
the centripetal and coriolis torques of Lhc complete r:p;id body robot dynamics, and 
we assume constant moments of inertia around each joint. The moment of inertia 
of link 2 is a constant with respect to ff a . The moment of inertia around joint 1 
depends on #j, We approximate the moment around joint 1 as the average of the 
maximum and minimum moments around that joint, over all possible configurations 
of the robot. The equations of motion for such a simplified system are: 

Torqtua = 1 ■ I (&13) 

where I is a constant diagonal 2x2 matrix. This gives a learning Operation of: 

uj^ 1 = u 1 ' - I„u mi ied " $ - h) { 6-1 4 } 

The results of applying this learning algorithm to the simulated two joint robot 
movement axe show in Figure 6,5, The movement is from point a to point b with zero 
initial and final velocity, acceleration, and jerk (seventh order polynomial) r Feed- 
back control is provided by independent PD controllers at each joint, each having 
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a bandwidth of 1.0 hertz and damping coefficient of 0,70? (based on Ida™^}. 
Figure 6.5 A shows the perform* ncp of the system on the initial trial, with the ini- 
tial feedforward torque^ U^ n baaed on the simplified dynamics model. Figure 6.5H 
shows the performance of the system after five Iterations of the Learning algorithm. 
In. this case, where an inaccurate inverse model of the robot has been used to update 
the feedforward torque command s no improvement in trajectory fo] lowing perfor- 
mance is seen- Had the fu[] i inverse dynamics model been utilized (under these ideal 
conditions of no measurement noise,, actuator noise, or external torque disturbances) , 
our algorithm would have produced ft perfect movement after one iteration. 

Appropriate Robot Models ; It has been argued that simplified models are 
appropriate for a robot with high gear ratios such as the PUMA, One must still mode! 
the other sources of dynamics prominent in these types of robots (Sweet and Good, 
19S5; Good, Sweet, and Stroebel, 1QS5), Higher order actuator dynamics may ptay 
ftn important role (see h for examplCj Goor [IQSSa^)). Our point is not that the rigid 
body dynamic & are the oniy appropriate model and must be used, but that we must 
be careful to include all significant dynamics in our models. Learning performance 
can be used to assess the quality of the models used to drive the learning. 

6*6 Convergence 

An important Question that arises is how close our model of the inverted robot dy- 
namics has to he to the true inverted robot dynamics for the proposed trajectory 
learning algorithm to converge to zero trajectory error. We will indicate how conver- 
gence can be tested in the .general nonlinear case h but by specializing the convergence 
criteria to the linear case and presenting a numerical example we will show that a 
convergence proof does not guarantee acceptable performance. 
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0.6.1 Nonlinear Convergence Criteria 

Analogies to solving simultaneous nonlinear equations: One way to view 

tha proposed trajectory learning algorithm is as a, procedure to solve the vector 
nonlinear equations: 

for fiff t where XJ " iS the sampled version of the desired trajectory expressed as a vector 
Cxj{l]i5tdl2] h . . . ,Xdj2 ^ ]) T, , i*// is a similar vector of the feedforward commands,, 3" is 
the number of sample* in the trajectory, and Jc Is- a nonlinear function representing 
the true robot and feedback controller dynamics. Since the initial stale of the robot 
at th* beginning of each movement is a known constant, R{) does not require tbe 
robot initial state as an explicit argument, 

Testing for convergence uajn^ lixud puint t hflory; One approach to analyzing 

convergence of algorithms to solve nonlinear equations it to combine the original 
equations and the proposed algorithm into a single iteration function, Each cycle 
of movement repetition and Feedforward command modification can be expressed as 
an iterative function, 

tin = Mfl f6 ' 16 ) 

and we can appeal to fixed point theory for convergence condition* for pjj [Isaacson 

and Keller, 1966}. This is the approach suggested by (Wang, 1984; Wang and 
Horowitz, 1985), 

6.6*2 Convergence Does Not Guarantee Good Performance, 

Although conceptually elegant, the convergence conditions provided by fixed point 
theory are too weak to be useful, Theieason for this is- that the duration of the exe- 
cuted trajectory is finite and the controlled plant is causal. The trajectory duration 
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is referred to M the teaming interval, as ihia is the lime over which liie feedfor- 
ward command is modified. Convergence can be guaranteed by assuring tbat the 
feedforward command errors propagate Forward in time osi ouch cycle of movement 
execution and feedforward command modification until the command errors propa- 
gate out of the learning interval (Figure 6.6). Command! and trajectory errors can 
grow exponentially as long as they are continually shifted forward in time, as we will 
show in the following example. 

Th« example plant 

Consider a rotary single degree of freedom robot. The dynamics of this robot in 
continuous lime are 

T = IS (6,17) 

where r is the torque at the joint, / is the moment of Inertia of the robot, and § is 
the. angular acceleration. We can express the dynamics of this robot in discrete time 
as 

8 k - 2#*-i + fa-* = — fa + tjt_,) (6. IB J 

where h is the sampling period (Astrorn and Wittenmark, 1984), Note that for 
later iiotitional convenience in inverting the plant the sampled torques have been 
renumbered to remove the plant delay. A sampling frequency of lkHz Is used in the 
numerical simulations to follow. 
We nee a feedback controller 

to give the second order pianL a natural frequency of lOHl arid a damping ratio 
of 0.707. The angle and the angular velocity of the single joint are both measured 
directly. 
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Deriving the convergence criteria 



Since the plant is linear the inverse plant model used for feedforward command 
modification will be linear > and it can therefore operate directly on the trajectory 
error, e[t\. The example plant is also minimum phase and the plant inverse will he 
causal- In order to put convergence bounds on the learning operator we will refer 
to a general Learning operator L instead of requiring the learning operator to be 
the inverse of the robot dynamks 1 R' 1 . L will be restricted to be a causal linear 
operator. 

The propagation of feedforward command errors from one movement to the next 
is given by subtracting the correct command, U* p from bath aides of Equation (6.11) : 

*uj} l [(] = Ai'[t]-&'[t| 

= ffii*[tj-*fl] + e(*l 

where i\t] is the impulse response of the learning operator L and + is the convo- 
lution operator. Since the learning interval (the duration over which we modify a 
particular feedforward command "j is finite and includes only T samples, we can ex- 
press Equation (6.20) aa a matrix equation by replacing convolutions with matrix 
operations. 

The sequence of sampled command errors and feedforward command error a are 
expressed as vectors, and the transformation between them can be expressed as a 
matrix equation £\i' — A^uy^: 



[6.20] 






a[0] 

a[l] t[0] 












\ *<m ) 



(6.21] 



\u\T\ *[r-i| <■■ a[o]J 

where A is a lower triangular Toeplstz matrix. The elements at\ are the impulse 
response of the transfer function relating feedforward command errors to actuator 
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command errors, 1/(1 + CfaJJJf?]), and we note that <i|0] must at ways be I dtae to 
a minimum loop delay of one sample before the feedback command can compensate 
for feedforward command errors. 

The trajectory errors are generated by feedforward command errors according to 
the matrix equation e* = BJ u\ 
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yb[T\ b\T-i.\ . 

where b[t\ is the impulse response corresponding to the transfer function i*[s|/[l H 

The effect of the learning operator can be expressed as a lower Lriangular Toeplitl 
matrix, L, using the matrix equation W = LeV 

{ tfu>] \ h[o\ o ... o ^| / c*|oj 

fu\l] f[l] l\) ..* 



«P(1] 



(6.23] 



{fu\T\} [tfi\ l\T-l\ -■ tp\ ) 

Combining the previous matrix equations. Equation 6.20 can be written as the 
matrix equation: 

Testing for convergence of the feedforward command error to iero reduces to 
guaranteeing that t3lt absolute values of all the cifi.cn raises of the matrix (A LB) 
are less than 1, Since A H L, and B are at] lower triangular Toeplltz matrices, the 
eigenvalues arc a:l equal to the diagonal element of (A- LB), a|0| -i[0|6[0]. We recall 
that a|0j - 1, and note that the test for convergence in the finite learning interval 
case reduces to a test involving only the first element of the learning operator impulse 



125 



response and. the controlled plant impulse response:: 

|i-*[oj&fo]!<i [e.2s) 

This convergence teat does not require L to be a Very accurate inverse plant model. 

A numerical example 

To demonitrate that the above convergence condition does not guarantee accept 
able performance let us example a particular learning operator which satisfies the 
convergence test but generates bad performance. Let lis choose a learning, operator 
that satisfies the finite learning interval convergence test exactly euch that 

1 - l\&,ya\ = < 1 [8.26] 

Such a learning operator is given by 

I [*1 = 



m [6.27) 

otherwise 



With this learning operator the nigcnvalucs of {A - LB) are all sctOj and we are 
guaranteed exact convergence in at most T movements, where T is the number of 
samples in the learning interval. This learning operator corresponds to correcting 
the feedforward command with a scaled version of the position error history. 

To examine the transient response of this learning operator we resort to nu- 
merical simulation. We initialize the feedforward command to have a single error 
on the first sample. The resulting position error is shown on the first graph of 
Figure 6,(5- Several of the following movements arc also shown, and by the 10th 
movement the position error has increased by a factor of 10 s *. It is difficult to see 
that with each movement the first non-zero position error of the previous movement 
is exactly cancelled, as the position errors later during the: movement are blowing up 
exponentially. This exponentially growing wave of errors does shift forward in time, 
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and after T movements will have shifted completely outside the learning interval. 
Had the correct inverse dynamite model been used as the learning operator [under 
these ideal conditions of no measurement noise, actuator noise, or disturbances) the 
feedforward command error" would have been entirely cancelled after one movement. 
This type of finite time interval convergence relies on the absence of any intro- 
duction of new error?. In the presence of sensor and actuator noise and plant distur- 
bance*, the convergence promised by the finite time interval analyses will probably not 
occur. In addition!, as the sampling interval length changes in Che discretization of a 
continuous time plant, the convergence properties of this type of learning algorithm, 
change. Convergence is improved when the plant is sampled at low frequencies, 

A performance test 

The previous ejcample demonBtrateH the need to test performance in addition to 
showing convergence of a teaming algorithm, In the nonlinear caae performance 
must usually be checked by simulation and by actual implementation, but with 
linear plants and feedback controllers we can take advantage of superposition and 
the abiiity to transform to the frequency domain to check how errors at all frequencies 
are affected by the learning algorithm. From Equation 6,12 or 6.20 we &cc that the 
errors are propagated according to 

fllJt!« 



where the argument z indicates that we have shifted to the frequency domain using 
the Z transform. With a perfect inverse model, (L = i? -1 ), A [a] is- zero, implying 
convergence of the feedforward command error to zero after one practice movement, 
The learning algorithm will perform badly if at any frequency the gain of A [s] is 
greater than 1. The gain of A[iJ as a function of frequency can be used to evaluate 
the efficacy of proposed learning operators. In the numerical example showing con- 
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vcrgence with poor performance, the poor performance wa* indicated by a maximum 
gain of A[i] equal to 530. This is approximately the factor by which the maximum 
position error was growing 0:1 each movement repetition in lilt sinv.]la:ir>-, 

6.7 Conclusions 

The contributions of this work arc: 

• The derivation of a nonlinear trajectory Learning algorithm baaed on explicit 
modelling of the plant. 

• The demonstration of Eoorl performance of the learning algorithm by imple- 
mentation on the MIT Serial Link Direct Drive Arm. 

■ The demonstration by a simple example that proofs of convergence do not 
Guarantee acceptable performance and of the need to verify performance. 

We conclude that more accurate robot models improve trajectory learning perfor- 
mance, and learning algorithms do not reduce tin.- n-cud for good models in robot 
control. 
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Chapter 7 



Future Research 



There axe many components of motor learning^ and the work in this thesis is only 
a small step in a research program in motor learning. Some of the Future research 
which is very closely related to the research presented in the thesis is discussed in 
the previous chapters. Other topics *re mentioned hire, 

7*1 Local models 

The single trajectory learning work can be ejtteftdcd to learning a group of similar 
trajectories. This involves buiSding a representation ofthe dynamics in a Email region 
of the augmented state space (the state variables and their derivatives], which I refer 
to as a local model. The local model could be a representation of the linearized 
dynamics about one of the desired trajectories in the group, or it could be a tabular 
representation. This local modelling would be a correction model to the previously 
identified global model. Key issues here are Etarag.e of all the Information necessary 
to represent the fine details, of the local dynamics, and deciding whether to pool 
information from different tasks or store experience according to the type of task. 
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7.2 Task level learning 

This thesis addressed only isolated learning modules. It is important to e.Tramine 
learning a complete task in addition to focussing on modules, as there will be many 
learning modules acting simultaneously in any realistic motor problem, and exploring 
their internet-ion is important. It is important to focus on other kinds of tasks where 
the effector system must respond to external demands rather than follow a given 
plan. 

Often task performance can be represented by a small number of parameters, 
and the execution of an arbitrary trajectory is not required,. Reducing the degrees 
of freedom of possible motor performance greatly simplifies the learning problem, 
and reduces the amount of knowledge necessary to learn from practice, 

7.3 Plan optimization 

The work in this- thesis has; focussed on improving execution of a, given plan. The 
perfect execution of a Hawed plan will not lead to optimal performance. Plans can 
be improved on the basis of models derived From experience. 

7.4 Object perception 

Rigid body load identification is a first step towards general object perception on 
the basis of force/torque information, There are many types of loads that are not 
well modelled by the rigid body dynamics framework, and alternative approaches to 
Identification and control Of these loads should he explored, 
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7.5 Human motor learning psychophysics 

This work suggest* a program of research on the quantitative psycbophysics of motor 
learning. My previous research, not discussed in this thesis, on biological movement 
kinematics, dynamics, and contra] provides i strong base for planned studies on 
biological motor learning. Findings relevant to motor learning include an invariance 
of the tangential velocity profile across many different movement conditions;, sepa- 
rability and scaling of load dynamics, and sea-ling of control gains with movement 
speed. In the immediate future my biological research will focus on two areas: 

A Comptttntt model; My own and other theoretical work have demonstrated 
possible roles of internal dynamic models in control and learning, and provided 
mathematical bounds on the required accuracy of these models, The goal of this 
research is to estimate, using simulations, how good human interna] models actually 
are. However, in order for these estimates to be meaningful we need Lo have accurate 
estimates of the mechanical properties of the arm and the effective feedback gains 
during, movement. I have begun this research by examining elboW movement t and 
discovered both modulation of effective feedback gainH during movement and scaling 
of effective feedback gains for movements made at different speeds. Theae measure- 
ments need to be extended to multi-joint movement, and then useful simulations can 
be undertaken. 

Motor foaming psyenopft$wi'(:s: Theoretical analysis and implementation on robots 
of different trajectory learning algorithms has revealed distinctive pattern* of tra- 
jectory modification for alternative algorithms. In order to test for the possible use 
of any of theEe algorithms by humans we need to quantitatively measure patterns of 
trajectory modification during learning of particular trajectories. 
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7.6 Concluding note 

This thesis provides a particular view on what intelligence is: identifying models 
of both the environment and the en-Ran ism itself, and using the identified models 
to predict, plan, and control action. Attempting to build intelligent machines that 
achieve or surpass human levels of performance tests our understanding of these 
ideas, and how complete this View of intelligence re&[]y 18. 
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Appendix A: Using The Identified Arm Model 
For Control 
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Experimental Evaluation of Feedforward and Computed Torque Control 

CH. An, C.G- Atfcewtt, J.D, Griffiths, and J.M, Hoiterbach 

MIT Artifical Intelligence Laboratory 
Cambridge, Mass. 02139 USA. 

Abstract. Trajectory tracking errors resulting from the Application of various con- 
trollers have been experimentally determined on the MIT Serial Lint Direct Drive' Arm. 
The controllers range from simple analog PD control applied independently at each 
joint to feedforward and computed torque methods incorporating foil dynamics. It was 
found that trajectory tracking errors decreased as more dynamic compensation terms 
were incorporated. There was no significant difference in trajectory traclring perfor- 
mace between the feedforward controller using independent digital servos and the fult 
computed torque controller. 

1 Introduction 

Despite voluminous publications on the control of robot arms, there have been few 
experimental evaluations of the performance of proposed controllers, A major reason for 

this ii the lack of a suitable manipulator for testing that fits these publications 1 modeling 
assumptions. Commercial robota are characteriied by high gear ratios,, substantial joint 
friction, and slow movement. As a result, their dynamics are approximated well by 
single-joint dynamics (Goer, 19B5-; Good, Sweet, and Strobel T 19S5), Moreover hardly 
any commercial robots allow the control of joint torque i which is required In many of 
the proposed controllers. 

Direct drive arms are increasingly overcoming some of the performance limitations of 
highly geared robots- (Asada and Youcef-TDiimJ. l$8 4; Curran and Mayer, 19fl5j Kuwa- 
hara et a|. T 1^5], The manipulator dynamics are made more ideal by the reduction of 
joint friction and backlash effects,, and the control of joint torques becomes more feasi- 
ble. Hence direct drive arms have the potential for serving as good experimental devices 
for testing advanced arm control strategies. When gearing is eliminated, however, the 
full nonlinear dynamic interactions between moving links are manifested. 

This paper reports on two seta of experiments with the MIT Serial Link Direct Drive 
Arm (SLDDA) involving a subset of proposed control strategies. The first set of ex- 
periments is based on a hybrid control system. There is an independent analog servo 
for each joint with the position and velocity references, and feedforward commands 
generated by a microprocessor, Since moat commercial arms are controlled by simple 
independent PID controller far each joint, an independent PD controller was tested on 
this arm to provide a baseline far comparison. The PD controller was augmented by 
Feeding forward first gravity compensation and then the complete rigid body dynamics 
to ascertain any trajectory following improvements attained by taking the nonlinear 
dynamics more fully into account. The second set of experiments shows the preliminary 
results of digital servo implementation, using one Motorola 68000 based mkroprocea- 
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5or to control all the joints of the SLDDA. The on-line computed torque approach is 

compared to the PD tail to the feedforward approach** using the digital Mrvo. 

The accuracy of the manipulator dynamic model impinges on the performance of 
feedforward and computed torque controL Since fries ion is negligible for direct drive 
arms, and presuming that one ha« good control of joint torques, the Issue of accuracy 
reduces to how well the Inertia] parameters of the rigid links are known. In our previous 
work, we dsveloped an algorithm for estimating these inertial parameters for any multi- 
link robot as a result of movement r and applied It to the SLDDA (An, Atkeson, and 
Hallerbach, IMS}. The present paper presents results of utilising, the estimated model 
to control the robot by both off-line (feedforward) and on-line (computed torque} com- 
putation of the joint torques. 

1.1 Contra] Algorithm* 

The full dynamics of an n degree-of- freedom manipulator axe described by 

n = J(q)q + V(q,q) + G(q) + F(q,q) [1] 

where n is the vector of joint torque* (for rotational joints) , q is the vector of joint 
angles, J is the inertia matrix, V is the vector of eoriolii and centripetal tena** <* it 
th* gravity vector, and F 1* a vector of friction terms- The simplest and most common 
form of robot control is independent joint PD control^ described by 

n = K,(q4 - q) + K r (n t - q) (2) 

where qj and q* are the desired joint velocities and positions, and K r and K v are n > n 
diagonal matricea of position and velocity gains. 

Feedforward control augments the basic PD controller by providing a sat of nominal 
torques tt//; 

n// [q j. q* hi) = $ M Q J + V( q^, qj) + G(q j) + £ (q*, q j) (3) 

where the hat (") refers to the modelled values. When this equation is combined with 
Mi *"« feedforward controller result*: 

n = a/j (<Up **, ^) + K. ft* - q) - K, [q t - q) (4) 

The feedforward term n f f can be thought of as a set of nominal torque* which Linearize 
the dynamic* (l) about the operating points qj, qj, and q Jr Therefore s it is reasonable 
to add the linear feedback terms K M (q rf -q) +K,(q*-q) as the control for the linearised 
system. These feedforward terms can be computed off-Lin^ since they are function of 
the parameters of the desired trajectory only. 

On the other hand T the computed torque controller compute* the dynamics on-line h 
using the sampled joint position and velocity data. The control equation is: 

tt*(q*» * Qd» Q. 4.) = h*)*' + V{q, q) + G(q1 -f F(q,q) (5) 
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where q* is given by H 

q"=q+K u (qj-^J +K fl fqj-q). (6) 

ir the robot model is exact, then each link of the robot Is decoupled, and the trajectory 
error goes to Mto. Gilbert and Ha (1984) have shown that the computed torque contra] 
method is robust to ama]l modelling errors, 

Previously, Liegeois, Foumler, and Aldon fl&&0) suggested feed forward control as 
an alternative to the on-line- computation requirements of computed torque control, al- 
though they did not prevent any experimental results. Colli, Car;, and Hughes (1931] 
discussed different linear state-feedback controller using a linearized model of a ma- 
nipulator. Asada 1 Kanade, and Talceyama (19G3) presented some results of applying a 
feedforward control to the early version of a direct drive ana at the Robotics Institute 
of CMTL", though for quite slow movements and for inert Lai parameters, derived by CAD 
modeling. The computed torque method has been considered by severaJ other inves- 
tigators (Paul, 1972; Markiewics, 1973 j Bejcsy, 1974; Lun, Walker, and Paul, 1980). 
Although simulation results have been presented, there has been no published report on 
the actual implementation of this controller, mainly due to the lack of an appropriate 
manipulator or on-line computational facility. 

In this paper, we first use the feedforward controller to evaluate the accuracy of 
our estimates of the link inertial parameters, and to compare its performance against 
several other simpler control methods for high speed movements. Then,, we present 
some preliminary results on the implementation of a computed torque controller, again 
using the estimated Inert! al parameters of the links, 

1+3 Estimates of inertlsi parameters 

The inertial parameters In the feedforward computation for the SLDDA were estimated 

by an algorithm developed previously (An, Atkeson, and Holler bach, 1984} , It was 
shown that the unknown inertial parameters of each link (mass, center of mass, and 
momenta of inertia) appear Linearly in the rigid body dynamics of a manipulator. Then 
a least squares algorithm was used to compute the estimates of these parameters. 

The accuracy of the inertial parameters was verified initially by comparing the mea- 
sured joint torques to the torques computed from the estimated parameters. This 
comparison, together with the torques computed from parameters derived by CAB 
modelling, is shown is Figure 1 for a 1-3* trajectory Df all the joints moving 250 de- 
grees. The results were actually superior for the dynamically estimated parameters 
than for the C AD-modelled parameters, A more practical verification of the estimated 
parameters is in generating feedforward torques as part of a control algorithm. The 
results of such experiments are presented in the next section. 

3 Robot Controller Experiments 

In this section, performances of several different controllers for full motion of the SLDDA 
are eval uated using the hybrid controller- The reference positions and velocities, and the 
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Figure V. The measured, CAD-modelled, ud estimated joint torques* 

feedforward torque* are generated by a microprocessor and input to three independent 

joint analog servos, We present evaluations of the following five control method* used 

for high speed movements of ail three joint* of the manipulator: 

1. PD controller with, position reference only: 

2 r FD controller with position reference and feedforward of gravity torques; 

n - Gfat) - K*q -4- K,(q, - q) 
3. PD controller with ponition and velocity reference!: 

n=K*(q*-q)+K,(qj^q) 

4> PD controller -*ith position and velocity references plus feedforward of gravity 
torque*; 



n = £{<uj + K,[qj - 4) + K P (q, - q) 
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Figure 2: Trajectory erron of the 5 controllers for full 1-3* motion. 

5, PD controllsr with position and velocity references plus feedforward of fuJ] dy- 
namics; 

a = J((u)q4 +■ V{*li,4i) + &(q*} + K,[qj - ^) + K,(q, - q) 

In these experiments, friction was neglected. The nominal position and velocity gains 
were adjusted experimentally to achieve high stiffness and overdamped charateriatica 
without the feedforward terms. 

A fifth order polynomial in joint apace was used to generate the reference trajec- 
tory. The joints moved from (fl(T h 269.1" t -30'] to (330* , 1 9.1*, £20*) in 1.3s, with peak 
velocities of 360 dep/j« and the peak accelerations of &54 dtgf&te 1 For each joint. For 
control methods (2) h (4), and [&) t the estimates of the [ink inertia! parameters given by 
An, Atkecon, and Holler bach {!&&&) were uaed to compute the feedforward torques. 

The trajectory errors for the above 5- controllers are shown on Figure 2. The errors 
for the hist controller are very larje and are nut of the graph range. Adding a. gravity 
feedforward term does not help very tnucti, and the trajectory errors for Controller 2 
are also very Large. This was expected since gravity feedforward is a static correction to 
Controller 1, and the dynamic effects dominate the response for high speed movements, 
Modifying the first controller by adding a velocity reference aignal improved the response 
greatly, Aa with Controller 2, adding a gravity feedforward term did not reduce the 
trajectory errors very ranch, and influenced mainly the fltaady state errors for joints 2 
and 3. 
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Figure 3: Trajectory error* of th* this* digital control leis for frill 1,3* motion- 

The full feedforward controller reduced the trajectory error* significantly for joints 1 
and 2 t with peak errors of only 0,33* and 0.64" h respectively. For joint 3, the feedforward 
torques did not Kelp because: of th* light inertia and the dominance of urunodelled 
dynamic* in th* motor and in bearing friction. The hjgh feedback gain* make this joint 
somewhat robust to then unmodeiled dynamic*; yet t the trajectory errors could not be 
reduced below 1,4* with the feedforward torque* bated on the ideal rigid body model 
of the link, 



3.1 Computed Torque ContFoilnr Experiment 

In this section, some preliminary result* are presented for the computed torque method 

implemented on the SLDDA. In this Implementation, the analog servo* are disabled, 
and the feedback computation is done digitally by one Motorola 68000 baaed micropro- 
cessor u*tng scaled fixed-point arithmetic. Written In the C language, the controller, 
including the full computation of the robot dy amies, runs at a 133 He sampling fre- 
quency. Although further improvements in computation time are possible, this speed 
wa» adequate in demonstrating the efficacy of dynamic compensation. The details of 
this implementation are discussed in (Griffiths, L98&). 

A similar fifth order polynomial trajectory a* in th* previous section was used for 
this experiment. Figure 3 show* the trajectory errors for three controller*: the digital 
PD controller, the feedforward controller using a digital servp, and the computed torque 
controller. The computed torque and the feedforward controllers both show a significant 



reduction in tracking error* for joints l and 2 compared with the PD control, with, no 
clear distinction between feedforward and computed torque. The tracking errors far 
joint 1 range from 4.4" to 2. T and for joint 2 go from 3-S fl to 2.0° with the addition of 
dynamic component. As before h the trajectory errors for joint 3 were not reduced by 
the computed torque or the feedforward controller, Again, this seems to indicate that 
our model for the third link may not be very good. 

The trajectory tracking performance of the computed torque controller is not as 
good aa that of the analog feedforward controller of the previous section. The main 
reason for this is the alow sampling frequency [133 Ha) of the digital controller, as 
compared to the 1 K£Le sampling frequency at which the reference input* were given 
to the analog servos. Improvements in the computation time should also improve the 
tracking: performance of the computed torque controller, 

3 Conclusion* 

We have presented experimental results of using an estimated dynamic model of the 
manipulator for dynamic compensation via feedforward and computed torque control 
methods. The results indicate that dynamic compensation can improve trajectory accu- 
racy significantly and that the estimated rigid body model of the manipulator is quite 
accurate and adequate for control purposes for joints 1 and 2. The umnodelled dynam- 
ics of the light third link, including the motor dynamics and friction, are dominant and 
yield Larger trajectory errors than at the other two joints. Therefore, for joint 3, it may 
be necessary to use a mors complete model to improve trajectory following. 

The results of the digital implementation of the feedforward and computed torque 
controllers were not as good as the hybrid feedforward controller. This indicates that 
if a robot was being used solely for free space movements without significant variation 
of its Loads t then a hybrid controller using an independent analog servo for each joint 
may be quite adequate, A hybrid controller, however. Is not flexible, and cannot handle 
varying loads or interactions with the environment. Future experiments with the MIT 
Serial Link Direct Drive Arm will concentrate on improving the computation time for 
the digital control system and on issues of force control. 

Acknowledgments 

This paper describes! research done at the Artificial Intelligence Laboratory of the Mas- 
sachusetts Institute of Technology. Support for the laboratory's artificial intelligence 
research la provided in part by the Defense Advanced Research Projects Agency un- 
der Office of Naval Research contracts N0O0l4r-8O-C-O5O and N0O014-82-K-Q334 and 
by the Systems Development Foundation. Partial support for CGA was provided by 
a Whitaker Fund Graduate Fellowship and for JMH by an NSF Presidential Young 
Investigator Award. 



141 



Reference* 

An, C-H s Atkeson, C G + and Hollcrbach, J NL, 1985, "Estimation of inertia! parameters 
of rigid body links of manipu]ators > ' T fVoe. £ft& Conf. Decision and Control For* 
Lauderdale, Florida, Dec. H-13- 

Asada, H., Kansde, K-, and Takeyama, L, 1983, "Control of & direct-drive inn, 1 * TVafw. 
tf/ASM£, 105, pp. 136-142, 

Aa»da, H., Youcef-Toumi , K< t and Lim, 5.K., 1984, "Joint torque measurement of a 
direct-drive arm," P™. £Srd Conf. Decision and Control, Lai Vegas, Dec. 12-14, pp. 
1332-1337. 

Bejcay. A.K., Tain, T,J„ and Chen, Y.L. t 1985, "Robot arm dynamic control by com- 
puter," Proc. IEEE Conf. Robotiei snd Automation* St. Louia, Mar. 25-28, pp. 
060-970. 

Curran, R-, and Mayer, G., 1985, "The architecture of the AdeptDfle dlreet-drive 
robot," Proc. American Control Conf., Boston, June 19-21, pp. 716-731, 
Gilbert, E,G +l and Ha, LJ., 1984, "An Approach to Nonlinear Feedback Control with 
Applications to Robotici," IEEE Trans. Syittm*, Man, Cyi*™., SMC- 14, pp. 879-884. 
GoUa, ELF., Garj T S-C*, and Hughe*, P,C„ IQ61* "Linear-state feedback control of 
manipulators," Math. Machine IWry, W, pp. 93-103. 

Good, M.C-. Sweet, L.M., and Stxtifal* K.L. f 19«o, "Dynamic models for control system 
design of integrated robot and drive ayetems," ASMS J. Dynamie Syittm*, Mfa*., 
Control, 107, pp. 53-59, 

Goor, R,M., 1988, *A new approach to robot control," Proc. American Control Conf, 
Boston, June 10~21 t pp. 3S5-38& 

Griffiths, John D. T 1980, Experimental Evaluation of Computed Torque Control, B.5. 
Thesis* MIT, Mechanical Eng.. 

Kuwahara, H., Oflo, Y,, Nikaldo, M„ and Mataumotcs T„ 10S5, "A precision direct- 
drive robot arW Pro*. American Control Con/., Boston, June 19-21, pp. 722-727. 
Liegeois, A,. Fgumier, A. and Aldon, M., 1980, "Model Reference Control of High- 
Velocity Industrial Robots," Pro*. Joint Automatic Control Conf., San Francisco, CA, 
August 13-15. 

Luh, J.Y,S., Walker, M.W., and Paul, R.F.C., 1980, "Resolved-Acceleration Control of 
Mechanical Manipulators,'' IEEE Tram. Auto. Contr. A AC-25 (3), pp. 4oo-474 t 
MarkiewicE, B., 1973 T "Analysis of the Computed Torque Drive Method and Compar- 
ison With Conventional Position Servo for a Computer-Controlled Manipulator," Jet 
Propulsion Laboratory, Pasadena, CA, March IS, 

Paul, ft. C, Sept, 1972, "Modeling^rajectory calculation and wrvoing of a counter 
controlled arm," AIM- 177, Stanford University Artificial Intelligence Laboratory. 



142 



References 



Albus, J. a, (1975a) *A new approach to imiiipulator control: the 

cerebellar mode?] articulation controller (CMACJ.* J. Dynamic Svitems, 
Measurement, and Control fit: 330-237 

Alblls, J. S. (1975b) "Data storage in the cerebellar model articulation 

controller (CMACj," J. Dynamic Systems* Mtamrtmcnt, and Control 07; 22H-233 

An, C, H M Atkeson, C. G. and Hollerbach, J. M. (1985) 

December it- 13 "Estimation or Inertia] Parameters of Ri K id Body Links of 
Manipulators^ Proe S^th Can/, Decision and Control, Fort Lauderdale, Florida 

Anand, D. K. (1984) Introduction to Control Systems. 2nd Edition New 
Yurk,. Pergamon Press 

Anderson, B* D. O. and R, M. Johnstone (1981) "Convergent 

Results for Widrow's Adaptive Controller.™ IFAC Canf, on System Identification 

Arimoto, S. (198B) May "Mathematical Theory of Learning With 

Applications to Robot Control. 1 * Froc. of jih Yale Workshop on Applications of 
Adaptive System* Theory Center for Systems. Science, Yale University, pp 215-220 

Arimoto! S*, S. Kawamura, and F. Miy&zaki (T984a) 

"Pettering Operation of Robots by Learning," J. of Robotic Systems 1; (2) 123-1 40 

Arimoto, S., S, Kawamura, and R Miyazaki (1984b) August 

"Can Mechanic at Robots Learn by Themselves? * Proc. of 2nd Inter. Symp, 
Robotics Restored Kyoto, Japan 

Arimoto, S», S. Kawamura, and F. Miyazaki ( 1984c) Dec 

"Bettering Operation oF Dynamic Syrtems Py Learning; A New Control Theory 
For Servomeehanisrns of Meehatronics Systems," Proc. 2Srd IEEE CDC, Las 
Vegas k Nevada 

Arlmotpj S M S< Kawamura, F. Mjyazaki, and S. Tamaki 
(1985) Dec. 11-13 "Learning Control Theory for Dynamical System*.* Proe. 



143 



&4th Conf. uft Decision and Control, Fort Lauderdale,, Florida 

Asada, H., Kanade, K* f and Takeyama, I. (19B3) ^Contra] of a 

Direct-Drive Arm." Trans, of ASMS 105l 136-142 

Aaada, H. t and Youeef-Toumi, K« (1984) * Analysis and design of a 
direct-drive arm with a live- bar- link parallel drive mechanism. 1 * ASME J. Dynamic 
Systems, Mew-, Control 1Q&; 225-230 

Asada, H M Youcef-Toumi, K., and Lim, S.K. (1984) Dec. 12-14 

"Joint torque measurement of a direct-drive arm," Pr&c, &$rd Conf r Decision and 
Control, La& Vegas, pp 1332-1337 

Afltrom, Karl J, (1983) ^Theory and Applications of Adaptive Control — 
A Survey/ Automatic 10: (5 J 471-486 

As tram, Karl J* and Wittenmark, Bjorn (1984) Computer 

Controlled Systems: Theory and Design. Englewood Cliffs> N.J.j Prentice-Hall, Inc. 

Atkeson, C-C, An, C.H., and Hollorbach, J,M. (1985a) Dec. 

11-13 ^lligid body load identification for manipulatora.'" Pror.. S^th Conf, Dee-won 
and Control t Fdrt Lauderdale, Florida 

Atkeson, C. G., An, £L IL, and Hollerbach, J* M. (1985b) 

Oct, "Estimation of Tnertial Parameters of Manipulator Loads and Links. 1 * Proc. 
3rd Int. Symp. Robotics Restoreh, Gouvieux (Chantilly), France, pp 32-33 

C. Atkeson and J* Mclntyre (1980) Juiy 1-3 "Applications of 

Adaptive Feedforward Contra] in Robotics.* 1 Proceedings. §nd IFAC Workshop an 
Adaptive Systems m Control and Signed Processing, Lund, Sweden 

Bejczy, A, K* (1974) "Robot Arm Dynamics, and Control," Technical 
Memorandum 33-669, Jet Fropulsiim Labaratury^ Fasadecta^ CA. 

Bernard, J + W. and L Lefkowitz (1962) "An Approach to 

Optimising Control Ea&ed on a Generalized Dynamic Model." Joint Automatic 
Control Conference, pp 3-2, 1-9 

Boniveilto, C, and R. Guidorzi (1971) Aug. 11-13 ''Parametric 

Identification of Linear Mu Invariable Systems," Free isth Joint Automatic 
Control Conference! St, Louis,, Missouri, pp 342-349 

Brady, Michael, Hollerbach, John M M Johnson, Timothy L», 
Lo^ano-Pprez, Tomas, and Mason, Matthew T* (1962) Robot 
Motion: Planning and Control, Cambridge, MA,, The MIT Press 

Bristol, E. H* (1967) "A Simple Adaptive System For Industrial Control/ 

144 



Instrumentation Technology 14: (6] 

Casalino, G, and L. Gambardella (1986a) April 7-10 "Learning of 

Movements in Robotic Manlpulatora." Proc. 1986 IEEE International Conference 
on Robotic* and Automation, San Francisco,, CA + pp 573-578 

Coiffet, P. (1983) Robot Technology: Interaction With The EnvinMUB™*. 
Englewood Cliffs, NJ, Prentice-Hail, Inc. 

Craig, J. J. (1084) June6-S "Adaptive Control of Manipulators Through 
Repeated Trialg." Proa. American Control Conference* San Diego, pp I5W-1574 

Craig, J. J, (1983) Adaptive Control of Manipulatory PhD. Thesis 

Proposal, Computer Science Department, Stanford University, April. 

D'Azzo, John J, and Houpis, Constantine H. (1966) Feedback 
Control System Analysts and Synthesis. New York, N,Y M McGraw-Hill Book 
Company 

Da vies, W. D, T« (1970) System Identification for Self- Adaptive Central, 

New York, Witey-Intersclence 

Doebelin* Em eat O. (1962) Dynamics Analysis and Feedback Control 
New York, Mcgraw-filll Bool: Company 

Eligabe, G. E. and G. R. Meira (1083) "A daptiv* Control with 
Feedforward Compensation and Reduced Reference Model." In: Landau, [ r D., M. 

Tomizuka, and D. M. Auslandcr, Edg.(eds) Adaptive Systems in Control and 
Signal Processing 1983; Proceeding* of Ike JFAC Workshop. New York, Pergamon 
Press, pp 79-BO 

Evdeigh, Virgil W. (1972) Introduction to Control Systems Design, New 
York, Mcgraw-Hill Book Company 

Franklin, Gene F. and Powell, J t David (1980) Digital Control of 
Dynamic Systems. Reading, MA,, Addison- Wesley Publishing Company 

Furuta, K* and M« Yamakita (1980a) April 7-10 iterative 
Generation of Optimal Input of a Manipulator." Proc. J98G IEEE International 
Conference on Robotif.n and Automation^ San Francisco, CA, pp 579-584 

Gelb, A., ed. (1974) Applied Optimal Estimation, Cambridge, MA, The 

MIT Press 

Gerstenbcrger, Michael Duane (1985) May * Manipulator Control 
Using Recursive Dynamics Computation." S. M. and E. F. Thesis, Mas^achusel ts 
Institute of Technology, Department of Electrical Engineering and Computer 

14S 



Science 

Gilbert, E.G., ^ud ila, I.J. (1 984 ) "An approach to nonlinear 
feedback control with applications to robotics, 11 IEEE Trans, Sy&ttms, Man t 
Cybtrn. SMC -14: 879-884 

Go] la, D.F., Gargj S.C., and Hughes, F«C* (1981) "Linear-Bute 

feedback con'.rol of iiliinLyulatort.'* Mich* Machine Theory 16: 93-103 

Go] ub, G-H.j and Van Loan, CF. (1983) Matrix computations,, 
John Hopkins University Press 

Good, M.C., Sweet, L,M + , and Strobel, K.L* (1985) "Dynamic 

modfils for control system design of Integrated robot and drive systems.* ASME J. 
Dynamic Systems, hftas., Control, IflY: 53-59 

Goodwin, Graham C and Kwai Sang Sin (1984) Adaptive 
Filtering, Prediction, and Contra]. Englewood Cliffs, NJ,, Prentice-Hall, Int. 

Goor, R.M- (1985a) June 19-31 "A new approach, to robot control." Pro*.. 
American Control Conf t Boston,, pp 385-3-EI9 

Goor s R* M + (1986b) Nov\ 17-22 "A New Approach to Minimum Time 
Robot Control. 7 ' Winter Annvoi Meeting &f ASME, Miami Beach, FL., pp 1-11 

Grabbe, Eugene M + , Simon Ramo, and Dean E. Wooldridge, 

Eds< (1961) Handbook of Automation, Computation, and Control, Volume 3: 

Systems and Components. Los Angeles, CA,, Thompson Ramo WooldridEc-, Inc. 

Graham, Robert E. (1946) "Linear Servo Theory." The Bell System 
Technical Jav-rnaJ %fc (4)616Hwl 

Groajean, A*, and C. Foulard (1976) 21-27 September, 1976 

[published in 1978, by North Holland PubL, New York) "Extensions of Levin's 
Method (or Eigenvector Method) For the Identification of Discrete, Linear, 
Multivariable, Stochastic, Time Invariant, Dynamic Systems." Identification and 
System Parameter Estimation, Proc. of Jtk JFAC $ymp n pp 2003-3010 

Hamming, R. W, (19T7) Digital Filters. Englewmd Cliffs, NJ, 

Prentice-Hall, Inc. 

Hara, S*, T* Omata, and M. Nakano (1985) Dec. 11-13 "Synthesis 

of Repetitive Control Systems and its Application. 71 Proc. Sjth Con/, on Decision 
and Control, Fort Landerdale, Florida 

HarokopoS, E* G. (l98fia) April 7-10 "Optimal Learning Control of 

Mechanical Manipulators in Repetitive Motions. 11 Proc. l$$& TE BE International 



116 



Conference on Robotics and Automation, San Francisco, CA p pp 396-401 

HarokopDS, E. G. (1986b) April 30-24 "Learning and Optimal Control of 
Industrial Robots in Repetitive Motions, n Pros. Robots 10 y Chicago, Illinois, pp 
4_27 _ 4_40 

Harris, C. J. and S. A, Billings, Bdsu (1981) Se!f-Tunin K and 
Adaptive Control; Theory and Applications, New York, Peter Peregrinua, Ltd. 

Hildreth, Ellen C and John M* Hollerbach (1985) "The 

Computational Approach to Vision and Motor ControL" AIM 846 K Artificial 
Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge MA. 

Hirzinger, G. and K. Landzettel (1985) March ^2S "Sensory 
Feedback Structures for Robots With Supervised Learning" Proe. of 1&S5 IEEE 
Int. Conf, on Robotits and Automation* St. Lauia, Missouri 

Hogartj Neville (1984) impedance Control of Industrial Robots," 
Robotics and Computer* Integrated Manufacturing 1: (l] 97-1 13 

Hogan, N. (IQRSa) ''Impedance control: An approach to manipulation: Part 
1- Theory." ASMS J. of Dynamic Systems, Measurement, and Control 107: 1-7 

Hog an, N* (1985b) ^Impedance control; An approach to manipulation; 
Part n - Implementation/ ASHE J. of Dynamic Systems, Measurement, and 
ConiroMQT: S-16 

Hogan, N. (1985c) "Impedance control: An approach to manipulation: 
Part ITT- Applications." ASME J, of Dynamic Systtrtis, Mtaauremtntj and 
Control 107: 17-24 

Hollerbach, J T M. (1984) "Dynamic Scalin K of Manipulator Trajectories." 
Journal of Dynamics Systems, Measurement, and Control 106; 102-1 ye 

Hollerbach, J. M., and Sahar> G. (198$) "Wrist-partitioned Inverse 
kinematic accelerations and manipulator dynamics. 1 * Int. J, Robotics Research 3- 
(4)61-70 

Hollars, Michael G. and Robert H, Cannon, Jr. (198 S) Nov. 

17-22 "Initial Experiments on the End-Point Control or a Two-Link Manipulator 
with Flexible Tendons/ ASMS Winter Annual Meeting, Miami Beach 

Horowitz, Isaac M> (1963) Synthi.*™ of Feedback Systems, New York, 
Academic Press 

Isaacson, Eugene and Herbert Bishop Keller (1966) Analysis 

of Numerical Methods, New York, John Wiley and Sons 



nr 



Isermanil, Rolf (1981) Digital Control Sy sterna. New York, Spring cr Verlag 

Igemiann, Rolf (1932) ^Parameter Adaptive Control Algorithm* — A 
Tutorial. 1 * Automatic* IS: (5)513-528 

Kanade, T., P. K, Khosla, and N. Tanaka (1984) December 
^Real-Time Control of CMU Direct-Drive Ann II Using Customized Inverse 
Dynamics. * Prot, of £$rd Con/, on Decision and Contra} Y Las Vegaa, Nevada 

Kawamura, S., F. Miyazald, and S, Arirnoto (1985) Dec n-ir-i 
11 Applications ofT..earning Method for Dynamic Control of Robot Manipulators." 
Prot. Sjth Conf. on Division and Control, Fart Landeidale, Florida 

Kawaimira, S*, F* Miyazaki, and S. Arimoto (1984) Oct 

"Iterative Learning Control For RobDtk Systems.'* ProE, of IECON flf, Tokyo, 
Japan 

Kazerooni, Homayoon (19 85) February * A Robust Design Method For 
Impedance Control of Constrained Dynamic Systems." Ph.- D- Thesis, 
Massachusetts Institute of Technology, Department of Mechanical Engineering 

Khosla, Pradeep Kp, and Takeo Kanade (1985) December 

"Parametfir Identification of Robot DynamicE." 1 Proceeding #/ Sjtk Conference on 
Derivin-n and Control, Ft, Lauderdale, FL, pp \ TJi-li— 1 7@Q 

KoopnnailSj T* (1937) Linear Regression Analysis of Economic Time 
Series . Haarlem, The Netherlands, DeErven F, Bohm 

Kotta, U., (1982) "On-line Eigenvector A Igorithms For The Identification 

of Dynamic Systems, 11 IFAC Conference on Identification and Sfrsitm ParamEter 
Estimation, Washington, DC, USA, pp 1265-1369 

Kotta, U* (1979) ''Structure and Parameter Estimation of Multivarlable 
Systems Using Eigenvector Method." IFAC Conferen.EE on Identification and 
System Parameter Extirpation, pp 453-45S 

Landau T Yoan D. (1979) Adaptive Control: The Model Reference 
x\pproach., New York, Marcel Deltker 

Landau , I. D.j M* Tomizuka, and D. M. An slander, Eds, 
(1983) Adaptive Systems in Control and Signal Processing L983; Proceedings of 
the IFAC Workshop. New York, Pergamon Press 

Lee, K k (1983) December w -Shape Optimization of A ssemblies Using 

Geometric Properties." Ph-D. Thesis, MIT, Department of Mechanical Engineering 
Department 



14S 



Leigh „ J. R„ (1986) Applied Distal Control; Theory, Design, and 
Implementation. Englcwood Cliffs^ NJ, Prentice Hall International 

Levin, M* J., (1984) "Estimation of a System Pulse Transfer Function In 
the Presence of Noise" IEEE Trans, on Auto. ConlK AC-fl: 229-355 

Liegeobj Alain (1985) Robot Technology: Performance and 
Computer-Aided D«*igju En K lewood Clifla T N J., Prentice-Hall, Inc. 

Liegeois, A., Fournier, A., and Aldon, M.J* (1080) August 

13-15 "Model reference control of high-velocity industrial robots.'" Free Joint 
Automatic Control Conf, San Francisco 

Ijungj L. and Soderstrom, T. (1983) Theory and Practice of 

Recursive Idenfication, , MIT Press 

Lull, J.Y.S., Walker, M.W., and Paul, R.P.C. (1980a) 

* Resolved-acceleration control of mechanical manipulatora, 1 * IEEE Trans. Auto. 

Contr. AC-25: (3)468-474 

Luh, J.Y*S* t Walker, M., and Paul, R.F. (1960b) "On-line 

computational scheme for mechanical manipulators." J. Dynamic S\fftcms t Mem.., 
Control I0J: 69-76 

Macmilla?), R* H, (1951) An Introduction to the Theory of Control in 
Mechanical Engineering, Cambridge, England, Cambridge University Frees 

Markiewicz, B. R. (lf>73) "Analysis of the computed Torque Drive 

Method and Comparison with the Conventional Position Servo for a 
Computer-Controlled Manipulator.* 3 Technical Memorandum 33-601 , Jet 
Propulsion Laboratory,, Pasadena, CA. 

Marquardt, D.W., and Snee, R,D* (1975) "Ridge regression in 
practice. 11 Amer, Statistician 2& 3-20 

Marr, David (1977) "Artificial Intelligence - A Personal View.* Artificial 

Intelligence 9: 37-48 

Marr, David (1982) Vision. San Francisco, W r H- Freeman and Co. 

Marshall, S. A. (19T8) Introduction to Control Theory. London, The 
Macmillan Press, Ltd, 

Mayeda, H., Osuka, K., and Kangawa, A. (1984) July a-e «A 

new identification method for serial manipulator arms.* Preprints IFAC &th World 
Congress, Budapest, pp 74-79 



My 



Mita, T,, and Ei KatO (1985) Dec. 11-13 "Iterative Control and its 
Application to Motion Control of Robot Arm - A Direct Approach to 
Servo-Probleme.* Proc. S4th Conf on Decision and Control, Fort Lauderdale, 
Florida 

Moore, J* R. (1951) "Combination Open-Cycle Closed-Cycle Systems. 1 ' 

Proceedings of the l.R.E. 30: 1421-1432 

Morita, Atsushi (February 27) B A Study of Learning Controllers For 
Robot Manipulators With Sparse Data,* M,S, Thesis, Massachusetts Institute of 
Technology,. Department of Mechanical Engineering 

Milker jee, A. (1984) November u Adaptation in biological sensory-motor 
systems: A model for robotic control,* PrfK r) SPIE Conf. on Intelligent Robots 
and Computer Vision, SPIE Vol 5S1, Cambridge 

Mukerjee, A,, and Ballard, D.rL (1985) Mar. 25-2A 

^Self-calibration in robot manipulators." Proe. IEEE Conf, Robotic f and 
Automation) St. Louis, pp 1050-1057 

Neuman, C.P., and Khoala, P.K. (1985) May 29-31 "Identification 

oF robot dynamics: an application of recursive estimation* Proe, lih Yale 

Workshop on Application* of Adaptive SyalcniB Theory, New Haven, pp 42-49 

Ogata, KatSUhiko (1970) Modern Control Engineering. Engl ewood Cliffs, 
N.J,. Frcmkc-Hall, Inc. 

Olsen, ELB., and Bckey, G-A, (1985) Mar. 25-28 "Identification of 

parameters in models of robots with rotary joints* Frvc. IEEE Conf Robotics 
and Automation, St, Louis, pp 1045-L050 

Oppeuheim., A* V., and A. S. Willaky (1983) Signals and 

Systems. Englewood Cliffs, NJ> Prentice-Hall, Inc. 

Orin, D + E+, R + B + McGhee, M. Vukobratovit, and G. 
Hartoch (1979) "Kinematic and Kinetic Analysis of Open-Chain Linkages 
Utilizing Newton-Euler Methods." Mathematical BioscienceB J3: 107-130 

Qwena, D* H. (1978) Feedback and Multi variable Systems. New York, 
Peter Peregrinus 

Paul, R + C + (1972) "Modeling, Trajectory Calculation t and SeJVOing of a 

Computer Controlled Arm. 11 A. I. Memo 177, Stanford Artificial Intelligence 
Laboratory, Stanford University, 

Paul, R,P. (1981) Robot Manipulators; Mathematics, Programming and 

Control. Cambridge, Mass., MTT Press 

150 



Peschon, John (IQfiJi} Disciplines and Techniques of Systems Con trol. New 
York, Blaisdell Publishing Company 

Premingerj J* and J* Rootenberg (1964) "Some Considerations 
Relating to Contra] Systems Employing the Invariance Principle. 11 IEEE 
Transactions on Automatic Control AC-9: 209-215 

Raibert, M. H« (1977) December ''Analytical equations vs. table look-up 
for manipulation: a unifying concept. 1 * Proc, IEEE Con/. Decision and Control, 
New Orleans, La.., pp £76-5 7ft 

Raibert, M. H. (1978) *A model for sensorimotor control and learning. 1 ' 
Biological Cybernetic* 29i 29-36 

Raibert, M + H. and B. K. P. Horn (1978) "Manipulator control 
using the Configuration Space Method/ Industrial Robot 5: (2)69-73 

Raven, FranClS Hi f 1978) Automatic Control Engineering, 3rd Edition 

New Vork, Mcgraw-Hiil Book Company 

Samaoil, C« (1983) Dec, 14-16 "Robust nonlinear control of robotic 

manipulators. 1 ' Proe. £2nd IEEE Con/. Decision and Control, San Antonio 

Saridis, George N. (1977) Self-Organizing Control of Stochastic Systems. 
New York j Marcel Dekker, Inc. 

Saras, Emanuel S. (1965) Computer Control of Industrial Processes. 
New York f Mcgraw-Hill Book Company 

Schumann, R. and II. Christ (1979) June 17-21 "'Adaptive 
Feedforward Controllers for Measurable Disturbances/ Joint Automatic Control 

Conference-, Denver, CO 

Schwarzenbach, J. and K. F. Gill (1984) System Modelling and 

Control. 2nd Edition Baltimore,. MD, Edward Arnold 

Shinskey, F. G. (1963) "Feedforward Control Applied, 1 ' ISA Journal m- r 
[11)01-65 

Shinskeyj F. G. (1979) Process-Control Systems: Application, Design, 

Adjustment. 2nd Edition New Y r ork, McGraw-Hill Book Company 

Sloti lie, J + -J. E. (1985) ^The robust control of robot manipulators." Int. 
J r Robotics Research 4; (2)49-64 

SUvey, S» D. (1975} Statistical Inference. London, Chapman and Hall 

Smith, F* W. (196 8 J System Laplace- Transform Estimation from 

151 



Sampled Data,* IEEE Tram, on Auto. Contr. AC-IS; 37-44 

Spon gl M, W., Thorp, J.S., and Kleinwaks, J.M. (1984) Dec, 

L2 14 The control of robot manipulators with bounded input. Part II: robustness 

and diaturbance rejection." /W SSrdlEBE Cenf. Dteiwm and Controi, Us 

Vegas., pp 1047-1052 

Sweet, L,M-, and Good, M.C. (1984) Dec. 13-14 -Re-definition of 

the robot motion control problem: effecte of plant dynamic* , drive system 

constraints, and user requirements* ft«, ttn* Con/. He* mm ««* CorrfW*, Lu 

Vegas, pp 724-732 

Sweet, L. M., and M. C. Good (1985) "Re-definition of the rgbot 

motion control problem; effect* of plant dynamics, drive system constraint*, and 

uaer refl.ui ^ ements,' , IEEE Control System* Maffaant K: (3)1^25 

Symon, K.R + (1971) Mechanics. Reading, Maas., Addison- Wericy 

Szentagothai, John and Michael A. Arbib (1975) Conceptual 

Models of Neural Orjaniaition. Cambridge, MA, The MIT Press 

Takahashi, Yasundo and Rabins, Michael J. and Anslauder, 

David M, (1970) Control and Dynamic System*. Reading, MA-, 

Addison- Wesley Publishing Company 

Tian-Xing, Y +] L. Duan, Z. Zhong-Jmi [1982) s Re-definition of 

the robot motion control problem: effects of plant dynamics, drive system 

constraints, and user requirements.." Free, of the American Control Conftrtnct, w 

165-169 

Togai, Maaaki and Osamu Yamano (1986) April 7 10 "Learning 

Control and Its Optimality: Analysis and Is Application to Controlling Industrial 

Robots. 11 Free. 1986 IEEE International Conference on Robotics and Automation, 

San Francisco, CA, pp 248-253 

Togai, JvL, and O. Yamano (1986) Dec, U-13 "Analysis and Design 

of an Optimal Learning Control Scheme for Industrial Robota: A Discrete Time 

Approach. * Prat. Sjih Conf. on Decision and Control, Fori Lauderdale, Florida 

Tou, Julius T. (1959) Digital and Sampled-data Control Systems. New 

York, Mcgraw-Hill Book Company, Inc. 

Tiypkin, Ya. 2- (1971) Adaptation and Learning in Automatic Systems. 

Ngw York, Academic Press 

Uchiyama, M. (1978) "Formation of High-Speed Motion Pattern of a 

Mechanical Arm by Trial." Trans, of Society of Instrument and Control Engine*™ 

152 



(Japan) 10: (5)706-712 

Voronov, A. A. and V. Yu. Rutkovsky (1984) "State-of-the-art 

and Prospects of Adaptive Systems." Automatic 20: (5)547 H7 

Vukobratovid, M. arid V, Fotkonjak (1982) Dynamics of 
Manipulation Robots: Theory and Application. New Yo*k a Spring- Verlag 

Vukqbratovic, M. and D. Stokic (1982) Control of Manipulation 
Robots: 1 henry and Application. New York, Spring- Verlag 

Wang, S H. (1984) October Computed Reference Error Adjustment 
Technique (CREATE) For The Control of Robot Manipulators.* Sgnd Annual 

Aikrton Conf on Communication, Control, and Computing 

Wang, S. H«, and I. Horowitz (1985) March "CREATE - A New 
Adaptive Technique," Prat, of the Nintteenth Annual 0*ttf. on Information 
Sciences and Systems 

Wohbj C. FL (1964) Automatic Control: An Tntrod action. New York 
McGraw-Hill Book Company * 

Weyrick, Robert C. (19T5) Fundamental of Automatic Control. New 
York, Mc.G raw-Hil | Book Company 

Whitney, D.E., Loiinski, C.A., and Rourke, J.M + (1984) 

"Industrial Robot Calibration Method and Results. 71 CSDL-P-1S79, Charles Stark 
Draper Laboratory Cambridge, Mass.. 

^7' Bernard, Ey K finE Walach, and Shimiel Shaffer 

IT J ° CL 31 " NoV 2 " Adap ' 1VL ' SiRral Preccsa 'ng for Adaptive Control ™ 
Proceedmsa of the Seventeenth Asilomar Confer*™* on Cireuiu, Systems & 
Computers, Santa Clara, CA, pp 265-270 

Widrow, B- and E. Walatn (1583) June 20-23 "Adaptive Slgna] 
Processing for Adaptive Control.* Proceedings of the IFAC Workshop? Adaptive 
Sttttems m Control and Signal Processing! im, San Francisco, pp 7-12 

Widrow, B. and others (1081) NWmber "On Adaptive Wr Ee 
Control. Proceedings of the Fifteenth Asilomar Conference on Circuits, Systems 
& Computers 

Widrow, B, s John M. McCool, and Barry P. Medoff (1978) 
Nov. 6 8 * Adaptive Control by Jnvtise Modeling* Proceedings of Twelfth 
A«fc m0 f Conference on Circuits, Systems, & Computers, Pacific Grove, CA, pp 

2f U y ^ 



153 



Winston, Patrick Henry (1984) Artificial Intelligence [2nd edition). 
Reading, MA S Addison Wesley 

Wittenmark B. (19TS) "Stochastic Adaptive Control Methods A 
Survey. 11 Int. J r Control 21; (5)705-710 

Wittenmark, B (1973) "A Self-tuning Regulator." ?3ll, Dtpt. of 
Automatic Contra], Lund Institute of Tttkitoltw, Lund, Sweden. 

Wpolverton, P. F. and R W, Murrill (196T) -An Evaluation of 
Four Uena in Feedforward Control." Instrumentation T^hnotogy 14: (lj 

Yale: (1985) "Proceeding* of The Fourth Yale Workshop on Applications of 
Adaptive Systeme Theory." Center for SyHt^ma Sdence, Becton Center, Yaie 

University, New Haven, Conn,. 



151 



